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IDENTIFICATION OF NOVEL E2F TARGET GENES AND USE THEREOF 



The present invention concerns altering plant characteristics. More particularly, the present 
invention relates to identification of genes and proteins involved in E2Fa/DPa-mediated 
5 processes and further relates to use of such genes and proteins for altering characteristics in 
plants. 

The present invention concerns a method for altering one or more plant characteristics, 
whereby the altered plant characteristic is selected from altered development, altered plant 
growth, altered, for example increased, plant yield and/or biomass. biochemistry, physiology, 

10 architecture, metabolism, survival capacity or stress tolerance by modifying expression of one 
or more of the genes according to the present invention and/or by modifying levels and/or 
activity of the proteins encoded by these genes. The present invention also concerns genetic 
constructs for perfomiing the methods of the Invention and to plants or plant parts obtainable 
by the methods of the present invention, which plants have altered characteristics compared to 

15 their othenwise isogenic counterparts. The invention also extends to recombinant nucleic acids 
and the use thereof in the methods according to the invention. 

Growth, development and differentiation of higher organisms are controlled by a highly ordered 
set of events called the cell cycle (Morgan, 1997). Cell division and cell grovvth are operated by 

20 the cell cycle, which ensures correct timing and high fidelity of the different transition events 
involved. Cell cycle regulation at both G1->S and G2-^M phase transitions depends on the 
formation of appropriate protein complexes and both transitions are believed to be the major 
control points in the cell cycle. The cell's decision to proliferate and synthesize DNA and 
ultimately to divide is made at the G1-»S restriction point in late G1. Overcoming this point of no 

25 return requires the cell's competence to initiate DNA synthesis as well as the expression of S- 
phase genes. Transcription of S-phase-specific genes requires binding to the DNA of an E2F 
transcription factor. Dimerisation of E2F with DP is a prerequisite for high affinity binding to the 
E2F consensus DNA binding site (An-)TT(G/C)(G/C)C(G/G)(G/C) (SEQ ID NO 2775), for 
example (TTT(C/G)(C/G)CGC), that can be found in the promoters of genes involved in DNA 

30 replication, repair, checkpoint control and differentiation (Ren et al., 2002; Weinmann et al., 
2001; Kel et al., 2001). Variants of this consensus sequence as well as other locations of this 
consensus sequences are also found. The heterodimeric E2F/dimerization partner (DP) 
transcription factor also regulates the promoter activity of multiple genes, which are essential for 
DNA replication and cell cycle control (Helin. 1998; MQIIer and Helin. 2000). E2F transcription 

35 factors are critical effectors of the decision to pass the restriction point and to allow the cell to 
proceed in S-phase. 
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identification of genes under the direct control of E2Fa and/or DPa proteins and genes tliat are 
indirectly controlled by the E2Fa/DPa complex. Examples of mechanisms for such indirect 
control include, (i) recognition by E2F/DP of other sequence elements that diverge from the 
consensus recognition site; (ii) possible association of E2F/DP with other DNA binding proteins 
capable of recognizing other DNA elements; and (iii) sequential transcription activation of a 
first gene capable of regulating transcription of a second gene. It is to be understood that 
having an E2F target sequence is not a prerequisite to be regulated by E2F. 

The gene that corresponds to the sequence deposited under the MIPS database accession 
number At1g57680 is an example of a gene, which is likely to be indirectly controlled by the 
E2Fa/DPa complex. This gene is of unknown function. It was surprising to find this unknown 
gene and the other genes of Tables 1 , 2, 4 and 5 to be involved in E2Fa/DPa controlled 
processes. The genes according to the present invention are represented herein with their 
nucleic acid sequence and con-esponding amino acid sequence as set forth in SEQ ID NO 1 to 
2755. 



Preferably expression and/or level and/or activity of one of the genes and/or proteins according 
to any of SEQ ID NO 1 to 2755 is modified. Alternatively expression and/or level and/or activity 
of one or more of those genes and/or proteins is modified. According to a further embodiment 
one or more gene/and or proteins of the same functional category as presented in Table 1 or 
Table 2, are modified. 



The tenri "modifying expression" relates to altering level (increasing expression or decreasing 
expression) or altering the time or altering the place of expression of a nucleic acid. The term 
"modified" as used herein is used interchangeably with "altered" or "changed". 
Modified expression (or level or activity) of a sequence essentially similar to any one of SEQ ID 
NO 1 to 2755 encompasses changed expression (or level or activity) of a gene product, 
namely a polypeptide, in specific cells or tissues. The changed expression, activity and/or 
levels are changed compared to expression, activity and/or levels of the gene or protein 
essentially similar to any one of SEQ ID NO 1 to 2755 acid in corresponding wild-type plants. 
The changed gene expression may result from changed expression levels of an endogenous 
gene essentially similar to any one of SEQ ID NO 1 to 2755 acid and/or may result from 
changed expression levels of a gene essentially similar to SEQ ID NO 1 to 2755 acid 
•previously introduced into a plant. Similarly, changed levels and/or activity of a protein 
essentially similar to any one of SEQ ID NO 1 to 2755 acid may be due to changed expression 
of an endogenous nucleic acid/gene and/or due to changed expression of nucleic acid/gene 
previously introduced into a plant. 
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Modified expression of a gene/nucleic acid and/or Increasing or decreasing activity and/or 
levels of a gene product may be effected, for example, by chemical means and/or recombinant 



means. 



Advantageously, modified expression of a nucleic acid according to the invention and/or 
modified activity and/or levels of a protein according to the invention may be effected by 
chemical means, i.e. by exogenous application of one or more compounds or elements 
capable of modifying activity and/or levels of the protein and/or capable of modifying 
expression of a nucleic acid/gene according to the invention. The temi "exogenous application" 
as defined herein is taken to mean the contacting or administering of a suitable compound or 
element to plant cells, tissues, organs or to the whole organism. The compound or element 
may be exogenously applied to a plant in a fomi suitable for plant uptake (such as through 
application to the soil for uptake via the roots, or in the case of some plants by applying directly 
to the leaves, for example by spraying). The exogenous application may take place on wild- 
type plants or on transgenic plants that have previously been transformed with a nucleic 
acid/gene according to the present invention or with another transgene. 

Suitable compounds or elements include proteins or nucleic acids according to SEQ ID NO 1 
to 2755 or proteins or nucleic acids essentially similar to SEQ ID NO 1 to 2755. Essentially 
similar proteins or nucleic acids are. homologues, derivatives or active fragments of these 
proteins and/or portions or sequences capable of hybridizing with these nucleic acids. The 
exogenous application of yet other compounds or elements capable of modifying levels of 
factors that directly or indirectly activate or inactivate a protein according to the present 
invention will also be suitable in practicing the invention. These compounds or elements also 
include antibodies that can recognize or mimic the function of the proteins according to the 
present invention. Such antibodies may comprise "plantibodies", single chain antibodies, IgG 
antibodies and heavy chain camel antibodies, as well as fragments thereof. Additionally or 
alternatively, the resultant effect may also be achieved by the exogenous application of an 
interacting protein or activator or an inhibitor of the gene/gene product according to the present 
invention. Additionally or alternatively, the compound or element may be a mutagenic 
substance, such as a chemical selected from any one or more of: N-nitroso-N-ethylurea. 
ethylene imine, ethyl methanesulphonate and diethyl sulphate. Mutagenesis may also be 
achieved by exposure to ionising radiation, such as X-rays or gamma-rays or ultraviolet light. 
Methods for introducing mutations and for testing the effect of mutations (such as by 
monitoring gene expression and/or protein activity) are well known in the art. 
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Therefore, according to one aspect of the present invention, there is provided a method for 
altering plant characteristics, comprising exogenous application of one or more compounds or 
elements capable of modifying expression of a gene and/or capable of modifying activity 
and/or levels of a protein according to the present invention. 

5 

Additionally or alternatively, and according to a prefen^ed embodiment of the present invention, 
modified of expression of a nucleic acid and/or modified of activity and/or levels of a protein, 
wherein these nucleic acids or proteins are essentially similar to any of SEQ ID NO 1 to 2755, 
may be effected by recombinant means. Such recombinant means may comprise a direct 
10 and/or indirect approach! for modifying expression of a nucleic acid and/or for modifying activity 
and/or levels of a protein. 

Therefore there is provided by the present invention, a method to alter plant characteristics, 
comprising modifying gene expression and/or protein levels and/or protein activity, which 
15 modification may be effected by recombinant means and/or by chemical means and wherein 
said gene and/or protein are essentially similar to any one of SEQ ID NO 1 to 2755. 

An indirect recombinant approach may comprise for example introducing, into a plant, a 
nucleic acid capable of increasing or decreasing activity and/or levels of the protein in question 

20 (a protein essentially similar to any one of SEQ ID NO 1 to 2755) and/or capable of increasing 
or decreasing expression of the gene in question (a gene essentially similar to any one of SEQ 
ID NO 1 to 2755). Examples of such nucleic acids to be introduced into a plant, are nucleic 
acids encoding transcription factors or activators or inhibitors that bind to the promoter of a 
gene or that interact with a protein essentially similar to any one of SEQ ID NO 1 to 2755. 

25 Methods to test these types of interactions and methods for isolating nucleic acids encoding 
such interactors include yeast one-hybrid or "yeast two-hybrid screens. 

Also encompassed by an Indirect approach for modifying activity and/or levels of a protein 
according to the present invention and/or expression of a gene according to the present 
30 invention, is the provision of a regulatory sequence, or the inhibition or stimulation of regulatory 
sequences that drive expression of the native gene in question or of the transgene in question. 
Such regulatory sequences may be introduced into a plant. For example, the nucleic acid 
introduced Into the plant is a promoter, capable of driving the expression of the endogenous 
. gene essentially similar to any one of SEQ ID NO 1 to 2755. 

35 

A further indirect approach for modifying activity and/or levels and/or expression of a gene or 
protein according to the present invention in a plant encompasses modifying levels in a plant of 
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interference RNAs (s.RNAs), microRNA (miRNA). etc. Therefore according to a particular 
aspect of the invention, there is provided a method to alter characteristics of plants, including 
technologies that are based on for example the synthesis of antisense transcripts, 
complementary to the mRNA of a gene essentially similar to any one of SEQ ID NO 1 to 2755. 
Another method for downregulation of gene expression or gene silencing comprises use of 
ribozymes, for example as described in WO9400012 (Atkins et al.), WO9503404 (Lenee et al.), 
WO0000619 (Nikolau et al.), W09713865 (Ulvskov et al.) and W09738116 (Scott et al.). 
Gene silencing may also be achieved by insertion mutagenesis (for example. T-DNA insertion 
or transposon insertion) or by gene silencing strategies as described among others In the 
documents WO9836083 (Baulcombe and Angell). WO9853083 (Grierson et al.). W09915682 
(Baulcombe et al.) or WO9953050 (Waterhouse et a!.). 

Expression of an endogenous gene may also be reduced if the endogenous gene contains a 
mutation. Such a mutant gene may be isolated and introduced into the same or different plant 
species in order to obtain plants having altered characteristics. Also dominant negative 
mutants of a nucleic acid essentially similar to any one of SEQ ID NO 1 to 2755 can be 
introduced in the cell to decrease the level/and or activity of the endogenous corresponding 
protein. 



Other methods to decrease the expression of a nucleic acid and/or activity and/or level of 
proteins essentially similar to any one of SEQ ID NO 1 to 2755 in a cell encompass, for 
example, the mechanisms of transcriptional gene silencing, such as the methylation of the 
promoter of a gene according to the present invention. 

Modifying expression of the gene also encompasses altered transcript level of the gene. 
Altered transcript levels of a gene can be sufficient to induce certain phenotypic effects, for 
example via the mechanism of cosuppression. Here the overall effect of overexpression of a 
transgene is that there is less level and/or activity In the cell of the protein, which Is encoded by 
the native gene showing homology to the introduced transgene. 

Cosuppression is accomplished by the addition of coding sequences or parts thereof in a 
sense orientation into the cell. Therefore, according to one aspect of the present invention, the 
characteristics of a plant may be changed by Introducing into a plant an additional copy (in full 
or in part) of a gene essentially similar to any one of SEQ ID NO 1 to 2755 already present in a 
host plant. The additional gene may silence the endogenous gene, giving rise to a 
phenomenon known as co-suppression. 
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Any variant suitable for use in ttie metiiods according to tfie invention may readily be 
determined using routine techniques, such as by following tlie methods described in the 
Examples section by simply substituting the sequence used in the actual Example with the 
fragment to be tested for functionality. 

A first example of such variants are "homologues" of the proteins of the present Invention, 
which homologues encompass peptides, oligopeptides, polypeptides, proteins and enzymes 
having amino acid substitutions, deletions and/or additions relative to the protein in question 
and haying similar biological and functional activity as an unmodified protein from which they 
are derived. To produce such homologues, amino acids of the protein may be replaced by 
other amino acids having similar properties (such as similar hydrophobicity, hydrophiliclty, 
antigenicity, propensity to fonn or break a-hellcal structures or p-sheet structures). 
Conservative substitution tables are well known in the art (see for example Creighton (1984). 

The homologues useful in the method according to the invention may have at least 30%, 32%, 
34%, 36%, 38%, 40%, 42%, 44%, 46%, 48% or 50% sequence identity or similarity (functional 
identity) to the unmodified protein, alternatively at least 52%, 54%, 56%, 58% or 60% 
sequence identity or similarity to an unmodified protein, or alternatively at least 62%., 64%, 
66%), 68% or 70% sequence identity or similarity to an unmodified protein. Typically, the 
homologues have at least 72%, 74%, 76%, 78% or 80% sequence identity or similarity to an 
unmodified protein, preferably at least 81%, 82%, 83%, 84%, 85%, 86%. 87%, 88% or 89% 
sequence identity or similarity, further preferably at least 90%, 91%, 92%, 93%) or 94% 
sequence identity or similarity to an unmodified protein, further preferably at least 95% 96%, 
97%, 98%) or 99% sequence identity or similarity to an unmodified protein. This %> identity can 
be calculated using the Gap program in the WISCONSIN PACKAGE version 10.0-UNIX from 
Genetics Computer Group, Inc based on the method of Needleman and Wunsch (J. Mol. Biol. 
48:443-453 (1970)) using the set of default parameters for painvise comparison (for amino acid 
sequence comparison: Gap Creation Penalty = 8, Gap Extension Penalty = 2; for nucleotide 
sequence comparison: Gap Creation Penalty = 50; Gap Extension Penalty = 3). 
The percentage of identity can also be calculated by using other alignment program well 
known in the art. For example, the percentage of Identity can be calculated using the program 
needle (EMBOSS package) or stretcher (EMBOSS package) or the program align X, as a 
module of the vector NTI suite 5.5 software package, using the parameters (for example GAP 
penalty 5, GAP opening penalty 15, GAP extension penalty 6.6). 
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Two special forms of homology, orthologous and paralogous, are evolutionary concepts used 
to describe ancestral relationships of genes. The term "paralogous" relates to gene- 
duplications within the genome of a species leading to paralogous genes. The term 
"orthologous" relates to homologous genes in different organisms due to ancestral relationship. 
The term "homologues" as used herein also encompasses paralogues and orthologues of the 
proteins used In the methods according to the invention. 

A preferred homologue is a homologue obtained from a plant, whether from the same plant 
species^ or different. The nucleic acid may be isolated from a dicotyledonous species, 
preferably from the family Brassicaceae, further preferably from Arabidopsis thaliana. 

Suitable homologues for use in the methods of the present invention have been identified in 
the genomes of rice and maize. These homologues are represented by their Genbank 
accession numbers in Table 1 and 2. Other homologues, especially orthologues from other 
plant species, are identifiable by methods well known by a person skilled in the art. In silico, 
methods involve running sequence alignment programs with the sequence of interest as 
mentioned above. In vivo methods involve the DNA encoding the protein of interest and are for 
example PGR techniques using degenerated primers designed based on the sequence of 
interest, which is any one essentially similar to any one of SEQ ID NO 1 to 2755, or 
hybridisation techniques with at least part of the sequence of interest. 

"Substitutional variants" of a protein are those in which at least one residue in an amino acid 
sequence has been removed and a different residue inserted in its place. Amino acid 
substitutions are typically of single residues, but may be clustered depending upon functional 
constraints placed upon the polypeptide; insertions will usually be of the order of about 1-10 
amino acid residues, and deletions will range from about 1-20 residues. 

"Insertional variants" of a protein are those in which one or more amino acid residues are 
introduced into a predetemiined site in the protein. Insertions can comprise amino-terminal 
and/or carboxy-terminal fusions as well as intra-sequence insertions of single or multiple amino 
acids. Generally, insertions within the amino acid sequence will be smaller than amino- or 
carboxy-terminal fusions, of the order of about 1 to 10 residues. Examples of amino- or 
carboxy-terminal fusion proteins or peptides include the binding domain or activation domain of 
a transcriptional activator as used in the yeast two-hybrid system, phage coat proteins, 
(histidine)6-tag, glutathione S-transferase-tag, protein A, maltose-binding protein, dihydrofolate 
reductase, Tag»100 epitope, c-myc epitope, FLAG®-epitope, lacZ, CMP (calmodulin-binding 
peptide), HA epitope, protein C epitope and VSV epitope. 
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from the protein. Amino acid vanants of a protein may reaaiiy u .... ... ^ 

recombinant DNA manipulations. The manipulation of DNA sequences to p 
inse,on o..e,etion variants o,ap.otein are we„..».n,^^^^^^ 
making substitution mutations at predetermined srtes in DMA are ,^ kno 
«,e art and include «13 mutagenesis. T7-Gen in .^ro '"^^'^l'^'^'^^^^ 
QuickChange Site Directed mutagenesis (Stratagene, San Diego, CA), PGR mediated 
3 directed mutagenesis or other site^lirected mutagenesis protocols. 

The tern, "deriva^ves" o, a protein according to .tte present invention a. ^^^^ 
o,igopep«des, p*pep«es, proteins and enzymes which may ^^^^^'^^^ 
deletions or addons of naturally and -7^;;::: "d Tnl t.e 

S to the amino acid sequence o, ^ ^^^-o^-^ 1 protein c, the present 

accession numbers presented in Table 1, 2. 4 ana o. u ^„,„^p^ ^hich may 

— " — s ---r ^^^^^^^^ 

rnmnrise naturally occurnng altered, giycosyiaxeu, doyio , _ th<= 

occurring amino acid residues relative to the amino add sequence of a natural^-oocurr 
25 protein of tiie present invention. 

another vacant ^fu, in the methods of .e .esent inve.on 

protein essentially similar to any one of SEQ ID NO .^..^uous amino acid 

— ^ '° ^ Catr rirdrr:r 

30 residues of said protein, and that has retaineo ine oioiuy rp^idues of 

^ . Jdues. These fragments originate from protein translation, starting at intemal ATG codon 

" „eously, the method according to the — ^^^^^^^^^^ 

fragments of DNA or of a nucleic acid sequence. The term DNA fragment 
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be created artificially, such as for example by the techniques of EMS mutagenesis. Typically 
such variants are created with the purpose of breeding the altered plant characteristic 
according to the present invention in a plant. Alternatively, naturally mutated alleles are the 
subject of such selection and breeding programmes, wherein the allele capable of conferring 
5 altered plant characteristics to the plant are selected and plants containing such allele are 
used for further breeding the trait. 

Accordingly, the present invention provides a method for altering plant characteristics, using a 
splice variant or an allelic variant of a nucleic acid sequence according to any one of SEQ ID 
10 NO 1 to 2755. 



15 



20 



^5 



The term "plant characteristic" means any characteristic of a plant, plant cell or plant tissue 
described hereafter. These characteristics encompass but are not limited to, characteristics of 
plant development, plant growth, yield, biomass production, plant architecture, plant 
biochemistry, plant physiology, metabolism, survival capacity, stress tolerance and more. DNA 
synthesis, DNA modification, endoreduplication, cell cycle, cell wall biogenesis, transcription 
regulation, signal transduction, storage lipid mobilization, photosynthesis and more. 

The terni "altering plant characteristics" as used herein encompasses any change in said 
characteristic such as increase, decrease or change in time or place. According to a preferred 
embodiment of the invention, altering a plant characteristics encompasses improving the plant 
characteristic, such as for example increasing the plant characteristic (e.g. yield), or 
accelerating the characteristic (e.g. growth rate). 

"Growth" refers to the capacity of the plant or of plant parts to expand and increase in biomass. 
Altered growth refers amongst others to altered growth rate, cycling time, the size, expansion 
or increase of the plant. Additionally and/or alternatively, growth characteristics may refer to 
cellular processes comprising, but not limited to. cell cycle (entry, progression, exit), cell 
division, cell wall biogenesis and/or DNA synthesis, DNA modification and/or 
endoreduplication. 

"Yield" refers to the harvestable part of the plant. "Biomass" refers to any part of the plants. 
These terms also encompass an increase in seed yield, which includes an increase in the 
biomass of the seed (seed weight) and/or an increase in the number of (filled) seeds and/or in 
the size of the seeds and/or an increase in seed volume, each relative to corresponding wild- 
type plants. An increase in seed size and/or volume may also influence the composition of 
seeds. An increase in seed yield could be due to an increase in the number and/or size of 
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ra«o of the total biomass overtt,e yield of t,arvestable parts, such as seeds. 

* « r^iont that is involved in determining the 
■Plan, development" means any cellular process of a plant tha^ • ^ 

developmental fate o, a plant cell, in particular the ^^^^"^ a. 
p„,en.or cel, will develop. Typical plant characterlst,cs accord n * * ^ ^ 
therefore characteristics re*g to cellular processes ,,,e.prnen.. 
for example, morphogenesis, photomorphogenes.s, shoot ^ 
vegetative development, reproductive development, stem ^^"^^.IZZ senLnce, 
n,echanlsms involved in determining cell fate, pattern fom,at,on, d«erent,at,on, 
time of flowering and/or time to flowier. 

■Plant architectures as t^edhereln refers tothee.e.aU^^^^^^^^^ 

one or more structural features or a combina.ion ^;"'^^^'"^r.rrs.,.r.e., and 
„ include the shape, si.e, number, position ^^''^^^^ I ^ 
panernation of any cell, tissue or organ or groups of ° J 

including the root, stem, lea,, shoot, petiole, t^chome. flower^ pe«^ cambium, wood, 
pollen, cvule, seed, embryo, endospem, - ^'^^^^ amongst 

heartwood, parenchyma, aerenchyma, s.eve element, phloem 



20 others. 
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The term -stress toleran.- is understood as the capa ility -J^^'-'^^^^:^:, 
performing in stress condKions such as envircnh,ental stress, ^ 
SallnKv. drought, heat, chilling and freezing are all descnbed a ^^^^ 
induce osmotic stress. The term -environmenta, stress ^^"^^^'^^^^ organ or whole 
.0 any adverse effect oh metabolfem. growth or v,ab,l^ of the ^^^^ ' J^^^^ 
Plant Which is produced b, a — ^^^^^^^ 

particularly, it also encompasses environmental factors such aerobic stress, 

Lgglng, drought, dehydration), anaerobic (low ^l^^^l^^: ^^^1^1 " 
osmotic stress, salt stress, -"^^^^^'^ ^^^l^^^^^^'. o:^Z^^ it. pathogen 
deprivation, pollutants stress (heavy '"^^^^^ J ,p„,,„3t,ons of these. Biotic 

(including viruses, bacteria, fungi, insects and nematodes) 

stress is stress as a result of the impact of a '-"^ o^~ J,,, , 

. stresses caused by patho^ns such asthestress 

stress caused by an o,gan.m, ^^^^^^^^ ^^^^ P'-* *-otertstlcs acceding to 
caused by a symbiotic or an epiphyte. Accoramgiy, v . , ■ 

tt,e present invention encompass early vigour, survival rate, stress tolerance. 
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Field-grown plants almost always experience some form of stress, albeit mild, and therefore 
the temis "growth", "yield" "biomass production" or "biomass" do not distinguish the 
perfomiance of plants under non-stressed from performance under stress conditions 
Advantageously, the effects of the invention on growth and yield are expected to occur under 
both severe and mild stress conditions (i.e. under stressed and non-stressed conditions). 

Characteristics related to "plant physiology" encompass characteristics of functional processes 
of a plant, including developmental processes such as growth, expansion and differentiation 
sexual development, sexual reproduction, seed set. seed aevelopment. grain filling, asexual 
reproduction, cell division, dormancy, gemnination, light adaptation, photosynthesis, leaf 
expansion, fiber production, secondary growth or wood production, amongst others; responses 
of a plant to externally-applied factors such as metals, chemicals, homiones. growth factors 
environment and environmental stress factors (e.g. anoxia, hypoxia, high temperature, low 
temperature, dehydration, light, day length, flooding, salt, heavy metals, amongst others) 
including adaptive responses of plants to said externally-applied factors. Particular plant 
physiology characteristics which are altered according to the methods of the present invention 
encompass altered storage lipid mobilization, photosynthesis, transcription regulation and 
signal transduction. 

Characteristics related to "plant biochemistry" are to be understood by those skilled in the art 
to refer to the metabolic characteristics. "Metabolism" as used in the present invention is 
interchangeable with biochemistry. Metabolism and/or biochemistry encompass catalytic or 
assimilation or other metabolic processes of a plant, including primary and secondary 
metabolism and the products thereof, including any element, small molecules, macromolecules 
or chemical compounds, such as but not limited to starches, sugars, proteins, peptides, 
enzymes, hormones, growth factors, nucleic acid molecules, celluloses, hemicelluloses. 
calloses. lectins, fibres, pigments such as anthocyanins, vitamins, minerals, micronutrients, or 
macronutrients, that are produced by plants. Preferably, the methods of the present invention 
are used to change the nitrogen or carbon metabolism. 

As shown In Tables 1 and 2, several of the E2Fa-DPa target genes identified have an E2F 
recognition sequence in their promoter and most of these genes are involved in DNA 
: replication. Therefore, provided by a particular embodiment of the present invention is a 
method as described above to influence DNA synthesis and DNA replication. The secondary 
induced genes, which are the genes not having the E2F target consensus sequence in their 
promoter region, encode proteins involved in cell wall biosynthesis, transcription, signal 
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. J^ZTle an unKnown ™. «ly, a ,a,«e number o, — genes 
!:;ld:f,ed as we,,, .a,n,y genes invCved ,n n«rate assignation or mauboi.. and carbon 



metabolism. 



The put.«ve direct E2Fa.DPa target genes as identmed by the presence of an E2F-DP-^nd,ng 
IL rily belong to the group of genes involved in DNA syn«,esis, whe^as the sec^nda^ 
Tlc^Zr^ are mainiy linked to nitrogen assimilation and carbohydrate metabo sn^. 
Tr" re T s eLdated by «,e present invention that enhanced levels of E2Fa-DPa ,n plants 
; vTXact on expression levels of genes involved in nitrogen ^'^^^^l^J^^ 
metabolism The experimental data suggest that in E2Fa/DPa overexpress.ng P'-'= *^;«^ ^ 
n r n.;ogen toL nucleotide synthesis pathway causing a decreased ^^^^^'^^^ 
nZgln compounds such as amino acids and storage proteins. Correspond.ng to these 
Igstre— found that the level Of endo.duplica«on of E2Pa.DPat.^^^^^^^^^^^ 

depends on the amount of nitrogen available in the medium. Also, «,ese data -99-' * 
, g rl arrest observed upon E2Fa/DPa expression resulU at leas, from a n*ogen drain o me 
nuceotide synthesis pathway, causing a decreased synthesis of other nKrogen components, 
such as amino acids and storage components. 

AS purine and pyrimidine bases are nitrogen-rich, the induction of nitrogen assimilation genes 
„ ^ t^rE2Pa dp' transgenic plants Is a mechanism to supply enough n«roge„ for nudeo.de 
" IZ^I. Most Jy this drain of nHrogen from essential biosynthet^c ^«,^s . * 

nucleo«de biosynthesis pathway has «s effects on many aspects of plan. '^^ ^Z^ 

he seen from .he reduCion of expression of vegetative storage protein ge.es and genes 

involved in amino-acid biosynthesis. 

' Therefore a particular aspect of the inven«on is .he use of genes involved -bo^; 

n«rogen meUbolism or al,oca.ion, for al.ering nitrogen and carbon metabCsm an* or t e 
he balance between carbon and n^rogen orto reallocate carbon and,or n*ogen or to the 
composition of components containing carbon and nitrogen. The elucida^on C gene that a e 

,0 pre to Shift the nitrogen ass,mila.ion from one ..ologica, process .0 anCher b^og-' P- 
is importan. for many app,ica«ons. These genes can for example be used ^^J^^^Z 
composifion o, n,.rogen-con.aining compounds In a cell, such as "'-^-^^^pe 
molecules, amino acid, nucleic acid, chlorophyll or any Cher me.aboMes. Ateow^^^*e ^cope 
. of the present inventions are these altered components obtainable by the methods of the 

35 present invention, with altered balance between carbon and nitn^gen. 
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Therefore, according to the present invention, there is provided a method as described above, 
wherein said altered metabolism comprises altered nitrogen and/or carbon metabolism. 



In a particular embodiment, said carbon metabolism comprises the processes of carbon 
5 fixation, photosynthesis and photorespiration. In another embodiment, said nitrogen 
metabolism comprises nitrogen fixation or the reallocation of nitrogen residues from the pool of 
amino acids into the pool of nucleic acids or vice versa. 

Microarray analysis of E2Fa-DPa overexpressing lines, as herein described, identified a cross- 
10 talking matrix between DNA replication, nitrogen assimilation and photosynthesis. It has been 
described previously that there is a link between carbon: nitrogen availability and growth, 
storage lipid mobilization and photosynthesis (Martin T. (2002)). Therefore according to the 
present invention there is provided, a method as described above, wherein said altered plant 
characteristic comprises altered storage lipid mobilization and/or photosynthesis. 

15 

The microarray studies elucidated for the first time particular genes that are upregulated and 
particular genes that are downregulated in a plant cell overexpressing E2Fa/DPa, many of 
which were of unknown function. It is now disclosed how to use these genes and/or proteins 
for altering plant characteristics. 

20 

According to a preferred embodiment, recombinant means are used to alter plant 
characteristics. More preferably, one or more of the genes essentially similar to any of SEQ ID 
NO 1 to 2755 is introduced into a plant as a transgene. Accordingly, the present invention 
provides a recombinant nucleic acid comprising: 
25 (a) one or more nucleic acid sequences essentially similar to any one of SEQ ID 

NO 1 to 2755; optionally operably linked to 

(b) a regulatory sequence; and optionally operably linked to 

(c) a transcription termination sequence. 

This recombinant nucleic acid is suitable for altering plant growth characteristics. 

30 

Constructs useful in the methods according to the present invention may be constructed using 
recombinant DNA technology well known to persons skilled in the art. The gene constructs 
may be inserted into vectors, which may be commercially available, suitable for transforming 
- into plants and suitable for expression of the gene of interest in the transformed cells. 

35 
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WO 2004/035798 . sequence is 

eukaryotic host cells. 

^ „e.o..acco*. . .e p..e. — „ - - P.*e. .V — .Jn.oa^ 
p.n. at least a pa. of a (natura, o. an..a o r„ ^^^^ , 
Chromosome (BAC)), which chromosome contains at least ^J^^ ^ 

part of a chromosome comprising at leasi a ycn'^ 
similar to any one of SEQ ID NO 1 to 2755. 

,n a particular em^oaiment ot the present — n -'^^^.Ta — t 
expressible promoter. In a further embodiment of the "^^^^'^^^'^2.^, ,„ 
promoter, su* as the G0S2 promoter, the ubiqu*n promoter, ^^^ ^ 
embClmen. of the Invention the promoter is a promofar ^'^-^"^^^J^^ 
ceus, such as, but not llm-rted to the cdc2 ^^J^J^^ enhancer, or a 

Mematlvel,, the regulatory element as mentioned above ^"^^J^^^"^ ,^ p,^,„, 
transcnptlonal enhancer that Is used to enhance expression of a gene according 



invention. 
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The term ..e^ulatorv sequence" .fers to control ONA -'^-^^^^^Z -^TJl 

affect expression of coding sequences to which they are operably ^"'^^^ 

control sequences differs depending *^ -^l " aT— • - 

sequences generally include promoters, ribosomal binding ^^'-Jl^ 

euKaryoUs generally control sequences in.ude promoters, <— 

Silencers. The term "control sequence" is intended to 

,h, nmsence of which are necessary for expression, and may also inciu 
:::^s components and .Ich determines ^^^.^^ e^cir 
.expressed. Reference -^2: ^ -^^^^^^ 

includes the transcnptlonal regulatory sequence initiation with or 

, gene, including «,e TATA box which Is required for accurate f " Motivating 

Without a CCAAT box sequence and ad«ional regulatory ^.^^^^^ 
sequences, enhancers and silencers, which alter gene ^^'^^ZZZ^^^^ 
developmental and/or external stimuli, or in a tissue-specrfic manner. The tem, prom 
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includes the transcriptional regulatory sequences of a classical prokaryotic gene, in which case 
It may include a -35 box sequence and/or a -10 box transcriptional regulatory sequences. 

The tenn "pronnoter" is also used to describe a synthetic or fusion molecule or derivative, which 
confers; activates or enhances expression of a nucleic acid molecule in a cell, tissue or organ 
"Promoter" is a DNA sequence generally described as the 5'-region of a gene, located proximal 
to the start codon. The transcription of an adjacent DNA segment is initiated at the promoter 
region. In the context of the present invention, the promoter preferably is a plant-expressible 
promoter sequence. Promoters, however, that also function or solely function in non-plant cells 
such as bacteria, yeast cells, insect cells and animal cells are not excluded from the invention 
By "plant-expressible" is meant that the promoter sequence, including any additional regulatory 
elements added thereto or contained therein, is at least capable of inducing, conferring, 
activating or enhancing expression in a plant cell, tissue or organ, preferably a 
monocotyledonous or dicotyledonous plant cell, tissue, or organ. 

Preferably, the nucleic acid sequence capable of modulating expression of a gene encoding an 
E2F target protein is operably linked to a constitutive promoter or a tissue specific promoter 
The temi "constitutive" as defined herein refers to a promoter that is active predominantly in at 
least one tissue or organ and predominantly at any life stage of the plant. Preferably the 
promoter is active predominantly but not exclusively throughout the plant 
Additionally and/or alternatively, the nucleic acid of the present invention may be operably 
linked to a tissue-specific promoter. The temi "tissue-specific" promoter as defined herein 
refers to a promoter that is active predominantly but not exclusively in at least one tissue or 
organ. 

Examples of prefen-ed promoters useful for the methods of the present invention are presented 
in Table I, II, III and IV. 



Table I: 



Exemplary constitutive promoters for use in the performance of the present invents 



GENE SOURCE 


EXPRESSION 
PATTERN 


REFERENCE 


Actin 


constitutive 


McElroy ef a/, Plant Ceil, 2: 163-171, 1990 


CAMV 35S 


constitutive 


Odell etal. Nature. 313; 810-812. 1985 


CaMV19S 


constitutive 


Nilsson etal.. Physiol. Plant. 700:456-462, 1997 


G0S2 


constitutive 


de Pater ef a/. Plant J Nov;2(6):837-44, 1992 


uoiquiiin constitutive 


Christensen etal. Plant Mol. Biol. 18: 675-689. 1992 
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rice cyclophilin 


constitutive 


Buchholz ef a/. Plant Mol Bior. z&io): a3M3, liJW 


maize H3 histone 


consiiiuitve 


Leoetitefa/ Mol Gen. Genet. 231:276-285, 1992 


actin2 


constitutive 


Anef a/, Plant J. 10(1); 107-121. 1996 



Table II: 

Exemplary seed-preferred promoters for use in the performance of the present 
invention 



GENE SOURCE 



EXPRESSION 
PATTERN 



seed-specific genes seed 



Brazil Nut albumin 



seed 



REFERENCE 

Simon, et ai. Plant Mol. Biol. 5: 191, 1985; Scorieifl, 
etai, J. Biol. Chem. 262: 12202. 1987.; Baszczynsid, 

etal.. Plant Mol. Biol. 14: 63 3, 1990. 

Pearson, etal.. Plant Mol. Biol. 18: 235-245, laaz, 



legumin 
glutelin (rice) 

zein 



seed 



Ellis, etal., Plant Mol. Biol. 10: 2U3-zi4. 1988. 
Takaiwa, et a/.. Mol. Gen. Genet. 208: 15-22, 1986; 
Takaiwa. etal., FEBS L etts. 221: 43-47. 1987. 
Matzke etal Plant Mol Biol, 14(3):323-32 1990 



napA 



seed 



wheat LMW and HMW | endospenn 
glutenin-1 



Staiberg, etal. Planta 199: 515-5 19, 1996. ^ 

Mol Gen Genet 216:81-90, 1989; MAR 17:461-2, 
1989 



wheat SPA 



seed 



Albaniefa/, Plant Cell, 9: 171-184. 1997 



wheat a, b and g- j endosperm 
gliadins 



EMBO 3:1409-15. 1984 



barley ttr1 promoter 
"bafiey bT, C, D, 
hordein 
barley DOF 



endosperm 
endosperm 

endosperm 



blz2 



endosperm 



synthetic promoter 



endosperm 



rice prolamin NRP33 endosperm 



Theor AppI Gen 98:1253-62, 1999; Plant J. 4:343-55, 

1993; Mol Gen Genet 250:750 -60,1996 

Mena et at, The Plant Journal. 116(1): 53-62. 1998" 



EP991 06056.7 

Vicente-Carbajosa etal.. Plant J. 13: 629-640, lyya. 



Wu etal. Plant Cell Physiology 39(8 ) 885-889, 1998 

1998 



rice -globulin Glb-1 
rice 0SH1 



endosperm 
embryo 



rice alpha-globulin | endosperm 
REB/OHP-1 



rice ADP-glucose PP endosperm 



Wu et al. Plant Cell Physiology 39(8) 885-889, 
Sato ef a/. Proc. Natl. Acad. Sci. USA, 93: 811 7-8122. 

1996 

Nakase etal. Plant Mol. Biol. 33: 513-522, laa^ 

Trans Res 6:157-68. 1997 ~ 



maize ESR gene | endosperm 
family 

sorgum gamma-kafirin j endosperm 



Plant J 12:235-46. 1997 

PMB 32:1029-35, 1996 
22 
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KNOX 


embryo 


Postma-Haarsma et al, Plant Mol. Biol. 39:257-71, 
1999 


rice oleosin 


embryo and 
aleuron 


Wu etat. J. Biochem., 123:386, 1998 


sunflov/er oleosin 


seed (embryo 
and dry seed) 


Cummins, ef a/., Plant Mol. Biol. 19; 873-876, 1992 



Table 111: 

Exemplary flower-specific promoters for use in the performance of the invention 



Gene source 


Expression pattern 


REFERENCE 


AtPRP4 


flowers 


http://sa lus. medium.ed u/m mg/tierney/html 


chalene synthase (chsA) 


flowers 


Van der Meer, ef al., Plant Mol. Biol. 15, 95-109, 
1990. 


LAT52 


anther 


Twell etal Mol. Gen Genet. 217:240-245 (1989) 


1 apetala-3 


flowers 
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PRO0117 


putative 40S ribosomal protein 


weak in endosperm 


PRO0122 


chlorophyll a/b-binding protein precursor (Cab27) 


very weak in shoot 


PRO0123 


putative protochlorophyllide reductase 


strong leaves 


PRO0126 


nfietallothionein RiCMT 


jstronq discrimination centpr / 
shoot meristem 


PRO0129 


G0S2 


[strong constitutive 


PRO0131 


GOS9 




PRO0133 


chitinase Cht-3 


Ivery weak meristem specific 


PRO0135 


alpha-globuiin 


[strong in endosperm 


PRO0136 


alanine aminotransferase 


Iweak in endosperm 


PRO0138 


cydin A2 




PRO0139 


Cyclin D2 




PRO0140 


Cyclin D3 




PRO0141 


cyclophyllin 2 


Ishoot and seed 


PRO0146 


sucrose synthase SSI (barley) 


1 medium constitutive 


PRO0147 


trypsin inhibitor iTR1 (barley) 


[weak in endosperm 


PRO0149 


ubiquitine 2 with intron 


strona con^titi itivp 


PRO0151 


WSI18 


embrvo + strp^^ 


PRO0156 


HVA22 homologue (putative) 




PRO0157 


EL2 




PRO0169 


aquaponne 


mfidium nnn^titiitivp in \/r\iinn 

plants 


PRO0170 


High mobility group protein 


strong constitutive 


PRO0171 


reversibly glycosylated protein RGP1 


weak constitutive 


PRO0173 


cytosolic MDH 


shoot 


PRO0175 


RAB21 


embryo + stress 


PRO0176 


CDPK7 




PRO0177 


Cdc2-1 


verv weak in meristem 

V Wl y vTWMi\ 111 II iwi 1 w Lw 1 1 1 


PRO0197 


sucrose synthase 3 




PRO0198 


DsVP1 




PRO0200 ( 


DSH1 L 

1 


Tieristem 


PRO0208 


mutative chlorophyllase 




PRO0210 ( 


DsNRTI 




PRO0211 


EXP3 




PRO0216 F 


)hosphate transporter OjPT1 




PRO0218 c 


)leosin 18kd L 


jleurone + embryo 


PRO0219 I 


biquitine 2 without intron 




PRO0220 F 


?FL 
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PRO0221 
PRO0223 



DRO0224 
"PRO0225 



fri^nt of prolami n K^-o promoter 
4XABRE 



PRO0226 




glutelin OSGLUAci 



BLZ-2_short (barley) 
BLZ-2jong (bariey; 
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PRO0227 
iPRO0228 

■1 a plan.. The ter. ne.n,ina.or encompasses a con.ro, ^^^^ l^J^^ , 

« and .erminaUon of .ranscnp.,on. Aad...ona re u ^ ^ ^^^^^ ^ 
..„s*«ona, as well as WaUona, enhancers, ^hose s^l^d . 

,em,^a,or and enhancer sequences, which .ay ^ 3^„ea ,n 
invention. Such se<,uences would be known or may readily be ob.a,ned by a p 

the art. 

.e .ne.lo cons.r.c.s o. .he ln.*n may ^fsp^* ^^^^^^^^ 

Which IS required for maintenance and/or repl,ca.,on ,n a spe*^ - JP 

=:r«rr :=r:,"=:'"-^^ — - - - 

limited to, tlie f 1 -ori and colE1 . 

.enetlc con.ru. may option* oomphse a seiectab. ma.er ^^^^^ 
.em, "seiecable marker gene" Includes any ^^"^^ ."^""^'^'^'^TLs which are 
.^ich n IS expressed to fa«e the —a .on marKers may 

wnsfoced or Uansfom,ed with a nudeic acid ™ 3 new 

he selected from markers that confer antibiCc or herb,c,de '^^ 
„e.ahor,c .ran or ma. allow visual selecUon. Examples o, selecUb. n.a^e g 
genes conferring resisUnce .0 an.lblo«cs (such ^ JJ^ ,p. en 

i::::^^^^^^^^^ ---- - 

,,.hosate,, or az:^r:::::::^ :^ ^^^^ 

mannose as sole carbon source), vibu „\ „r fluorescence (Green 

e,am,.e beta^lucuronidase, GUS), luminescence (such as ^'^^J^^^^l^^ J^3,„ 
Buolen. Protein. GFP, and de— thereof). J J^^e gene acr), 

marker genes include *e ampiciliin resistanc^ (Ampr), tetracyCne res.s.anc 

26 
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bacterial kanamycin resistance gene (Kanr), phosphinothricln resistance gene, and the 
chloramphenicol acetyltransferase (CAT) gene, amongst others 

The methods of the present invention are particularly relevant for applications in agriculture 
and horticulture, and serve to develop plants that have altered characteristics. 

Accordingly, another embodiment of the invention is a method for making a transgenic plant 
comprising the introduction of a recombinant nucleic acid as mentioned above into a plant. "A 
plant" a? used herein means plant cell, plant part etc. as defined herein below. 
According to a preferred embodiment this method for the production of a transgenic plant 
further comprises the step of cultivating the plant cell under conditions promoting regeneration 
and mature plant growth. 
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A further embodiment relates to a method as described above, comprising stably integrating 
into the genome of a plant a recombinant nucleic acid as mentioned above. Alternatively the 
recombinant nucleic acids comprising the nucleic acids of the present invention are transiently 
introduced Into a plant or plant cell. The protein itself and/or the nucleic acid itself may be 
introduced directly into a plant cell or into the plant itself (including introduction into a tissue 
organ or any other part of the plant). According to a preferred feature of the present invention' 
the nucleic acid is preferably introduced into a plant by transformation. 

The term "transformation" as referred to herein encompasses the transfer of an exogenous 
polynucleotide into a host cell, irrespective of the method used for transfer. Plant tissue 
capable of subsequent clonal propagation, whether by organogenesis or embryogenesis may 
be transformed with a genetic construct of the present invention and a whole plant regenerated 
therefrom. The particular tissue chosen will vary depending on the clonal propagation systems 
available for, and best suited to. the particular species being transformed. Exemplary tissue 
targets include leaf disks, pollen, embryos, cotyledons, hypocotyls. megagametophytes, callus 
tissue, existing meristematic tissue (e.g.. apical meristem, axillary buds, and root meristems) 
and induced meristem tissue (e.g. cotyledon meristem and hypocotyl meristem) The 
polynucleotide may be transiently or stably introduced into a host cell and may be maintained 
non-integrated, for example, as a plasmid. Alternatively and preferably, the transgene may be 
stably integrated into the host genome. The resulting transfomied plant cell can then be used 
■■ to regenerate a transformed plant in a manner known to persons skilled in the art. 

Transformation of a plant species is now a fairiy routine technique. Advantageously any of 
several transfomiation methods may be used to introduce the gene of interest into a suitable 
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ancestor cell. Transformation methods include the use of liposomes, electroporation. 
chemicals that increase free DNA uptake, injection of the DNA directly into the plant, particle 
gun bombardment, transformation using viruses or pollen and microprojection. Methods may 
be selected from the calcium/polyethylene glycol method for protoplasts (Krens et al., 1982; 
5 Negrutiu et sL, 1987); electroporation of protoplasts (Shillito et al, 1985); microinjection into 
plant material (Crossway et a/., 1986); DNA or RNA-coated particle bombardment (Klein etai. 
1987) infection with (non-integrative) viruses and the like. 

Transgenic rice plants expressing a gene according to the present invention are preferably 
10 produced via Agrodacferium-mediated transfomiation using any of the well known methods for 
rice transformation, such as described in any of the following: published European patent 
application EP 1198985 A1, Aldemita and Hodges (1996); Chan et ai (1993), Hiei et al. 
(1994), which disclosures are incorporated by reference herein as if fully set forth. In the case 
of corn transformation, the preferred method is as described in either Ishida et al. (1996) or 
15 Frame et al. (2002). which disclosures are incorporated by reference herein as if fully set forth. 

Generally after transformation, plant cells or cell groupings are selected for the presence of 
one or more markers which are encoded by plant-expressible genes co-transferred with the 
gene of interest, following which the transformed material is regenerated into a whole plant. 

20 

Following DNA transfer and regeneration, putatively transformed plants may be evaluated, for 
instance using Southern analysis, for the presence of the gene of interest, copy number and/or 
genomic organisation. Alternatively or additionally, expression levels of the newly introduced 
DNA may be monitored using Northern and/or Western analysis, both techniques being well 
25 known to persons having ordinary skill in the art. 

The generated transformed plants may be propagated by a variety of means, such as by clonal 
propagation or classical breeding techniques. For example, a first generation (or T1) 
transfonned plant may be selfed to give homozygous second generation (or T2) transfomiants. 
30 and the T2 plants further propagated through classical breeding techniques. 

The generated transformed organisms may take a variety of forms. For example, they may be 
chimeras of transformed cells and non-transformed cells; clonal transformants (e.g., all cells 
transformed to contain the expression cassette); grafts of transformed and untransformed 
35 tissues (e.g., in plants, a transfonned rootstock grafted to an untransformed scion). 
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The present invention also encompasses plants obtainable by the methods according to the 
present Invention. The present invention provides plants having one or more altered 
characteristics, when compared to the wild-type plants, characterised in that the plant has 
modified expression of one or more nucleic acids and/or modified level and/or activity of a 
5 protein, wherein said nucleic acid and/or protein are essentially similar to any one of SEQ ID 
NO 1 to 2755. 

In one embodiment of the present invention, such a plant is a transgenic plant. According to a 
further embodiment such transgenic plant comprises an isolated nucleic acid and/or protein 
sequence essentially similar to any one for Seq Id NO 1 to 2755. 
10 Alternatively, according to one embodiment of the present invention, such a plant having one 
or more altered plant characteristics and having modified expression of one or more nucleic 
acids and/or modified level and/or activity of a protein, wherein said nucleic acid and/or protein 
are essentially similar to any one of SEQ ID NO 1 to 2755., is created by breeding techniques. 

1 5 The present invention clearly extends to any plant cell or plant produced by any of the methods 
described herein, and to all plant parts and propagules thereof. The present invention extends 
further to encompass the progeny of a primary transformed or transfected cell, tissue, organ or 
whole plant that has been produced by any of the aforementioned methods, the only 
requirement being that progeny exhibit the same genotypic and/or phenotypic characteristic(s) 

20 as those produced in the parent by the methods according to the invention. The Invention 
accordingly also includes host cells containing an isolated nucleic acid molecule encoding a 
protein essentially similar to any one of SEQ ID NO 1 to 2755. Such host cell may be selected 
from plants, bacteria, animals, algae, fungi, yeast or insects. Prefenred host cells according to 
the invention are plant cells. The invention also extends to harvestable parts of a plant such as 

25 but not limited to seeds, leaves, fruits, flowers, stem cultures, stem, rhizomes, roots, tubers 
and bulbs. 

The term "plant" as used herein encompasses whole plants, ancestors and progeny of the 
plants and plant parts. Including seeds, shoots, stems, roots (Including tubers), and plant cells, 

30 tissues and organs. The term "plant" also therefore encompasses suspension cultures, 
embryos, meristematic regions, callus tissue, leaves, gametophytes, sporophytes, pollen, and 
microspores. Plants that are particularly useful in the methods of the invention include all 
plants which belong to the superfamily Viridiplantae, in particular monocotyledonous and 
■ dicotyledonous plants including a fodder or forage legume, ornamental plant, food crop, tree, 

35 or shrub selected from the list comprising Acacia spp., Acer spp., Actinidia spp., Aesculus 
SPP; Agathis australis, Albizia amara, Alsophila tricolor, Andropogon spp., Arachis spp, Areca 
catechu, Astelia fragrans. Astragalus cicer, Baikiaea plurijuga, Betula spp., Brassica spp.. 
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Bruguiera gymnorrhiza. Burkea africana, Butea frondosa, Cadaba fannosa, CalUandra spp. 
camellia sinensis, Canna indica, Capsicum spp., Cassia spp.. Centroema . putescens, 
Chaenomeles spp., Cinnamomum cassia, Ooffea aratioa. Colophospermum mopane, 
■ Coronillia varia. Cotoneaster serotina. Crataegus spp.. Cuoumis spp.. Cuprassus spp.. 
5 Cyathea dealbata. Cydonia oblonga. Cryptomeria japonica. Cymbopogon spp.. Cynthea 
dealbata. Cydonia oblonga. Dalbergia monetaria. Davallia divarioata, Desmodium spp., 
Dioksonia squarosa, Diheteropogon ampleotens, Dioclea spp. Dolichos spp.. Do.ycn.um 
rectum: Echinochloa pyramidalis. Ehrartia spp., Eleusine coracana. Eragrestis spp.. Erythnna 
spp Eucalyptus spp.. Eudea schimperi. Eulalia viHosa, Fagopyrum spp., Feijoa seilow.ana. 
10 Fragaria spp.. Flemingia spp. Freycinetia banksii. Geranium thunbergii, Ginkgo biloba, Glyo.ne 
javanica. Gllricidia spp, Gossypium hirsutum, Grevillea spp.. Guibourtia coleosperma, 
Hedysarum spp, Hemarthia altissima. Heteropogon contortus. Hordeum vulgare, Hyparrhen.a 
rufa Hypericum erectum, Hyperthelia dissolute. Indigo, incamata. Iris spp., Leptarrhena 
pyrolifolia. Lespediza spp., Lettuca spp.. Leucaena leucocephala. Loudetia simplex. Lotonus 
15 bainesii, Lotus spp., Macrotyloma axlllare. Ma/us spp., Manihot esoulenta. Medicago sat,va, 
Metasequoia glyptostroboides. Musa sapientum. Nicotianum spp.. Onobrych.s spp.. 
Ornithopus spp., Oryza spp., Peltophorum africanum, Pannisetum spp.. Persea grat,ss,ma. 
Petunia spp.. Phaseolus spp.. Phoenix canariensis. Phormium cookianum. Phot.n.a spp.. 
Picea glauca, Pinus spp., Pisum sativum. Podocarpus totara. Pogonarthria fleck,,. 
20 Pogonarthria squamosa, Populus spp.. Prosopis cineraria. Pseudotsuga menz.es.,. 
Pterolobium stellatum. Pyrus communis. Quercus spp.. Rhaphiolepsis umbellate. 
Rhopalostylis sapida. Rhus natalensis, Ribes grossularia. Ribes spp.. Robinia pseudoaoao.a. 
Rosa spp.. Rubus spp.. Salix spp.. Schyzachyrium sanguineum. Sciadopnys vert.c.llata. 
Sequoia sempervirens. Sequoiadendron giganteum. Sorghum bicolor. Sp.naca spp., 
25 Spombolus fimbriatus, Stiburus alopecuroides. Stylosanthos humilis. Tadehagi spp. Taxod.um 
distichum. Themeda triandra. Trifolium spp.. Triticum spp.. Tsuga heterophylla, Vaccn.um 
spp Vicia spp., Vitis vinifera. Watsonia pyramidata, Zantedeschia aethiopica. Zea mays, 
amaranth, artichoke, asparagus, broccoli. Brussels sprouts, cabbage, canola. carrot, 
cauliflower, celery, collard greens, flax. kale, lentil, oilseed rape. okra. onion, potato, nee. 
30 soybean, straw, sugar beet, sugar cane, sunflower, tomato, squash tea. trees. Alternatively 
algae and other non-Viridlplantae can be used for the methods of the present invention. 
Preferably the plant according to the present invention is a crop plant selected from nee. 
maize, wheat, barley, soybean, sunflower, canola. sugarcane, alfalfa, millet, legum.nosae 
■ (bean, pea), flax . lupinus. rapeseed. tobacco, popular and cotton. Further preferably, the plant 
35 according to the present invention is a monocotyledonous plant, most preferably a cereal. 
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The term 'gene(s)' or 'nucleic acid', 'nucleotide sequence', as used herein refers to a polymeric 
form of a deoxyribonucleotides or ribonucleotide polymer of any length, either double- or 
single-stranded, or analogs thereof, that have the essential characteristics of a natural 
ribonucleotide in that they can hybridize to nucleic acids in a manner similar to naturally 
occurring polynucleotides. A great variety of modifications have been made to DNA and RNA 
that serve many useful purposes known to those skilled in the art. For example, methylation, 
'caps' and substitution of one or more of the naturally occurring nucleotides with an analog. 
Said terms also include peptide nucleic acids. The term "polynucleotide" as used herein 
includes such chemically, enzynrjatyically or metabolically modified forms of polynucleotides. 

With "recombinant nucleic acid" is nDeant a nucleic acid produced by joining pieces of DNA 
from different sources through deliberate human manipulation. 

The inventors identified genes that are upregulated in plants overexpressing E2Fa/DPa. These 
genes can be used to simulate E2Fa/DPa related effect in a plant. 

Therefore, according to the invention, there is provided a method to alter characteristics of a 
plant, comprising overexpression of one or more nucleic acids essentially similar to any one of 
SEQ ID NO 1 to 2755, or wherein the method comprises enhancing the level and/or activity of 
one or more proteins essentially similar to a protein sequence essentially similar to any one of 
SEQ ID NO 1 to 2755. 



Also identified were genes that are downregulated in plants overexpressing E2Fa/DPa. These 
genes can be used to simulate E2Fa/DPa related effect in a plant. Therefore, according to the 
invention, there is provided a method to alter plant growth characteristics, comprising 
downregulation of expression of one or more nucleic acids essentially similar to any one of 
SEQ ID N0 1 to 2755, or wherein the method comprises decreasing level and/or activity of one 
or more proteins essentially similar to any one of SEQ ID NO 1 to 2755. 



Genetic constructs aimed at silencing gene expression may comprise the nucleotide sequence 
essentially similar to any one of SEQ ID NO 1 to 2755 or one at least a portion thereof in a 
sense and/or antisense orientation relative to the promoter sequence. Preferably the portions 
comprises at least 21 contiguous nucleic acid of a sequence to be downregulated. Also, sense 
or antisense copies of at least part of the endogenous gene in the form of direct or inverted 
repeats may be utilized in the methods according to the invention. The characteristics of plants 
may also be changed by introducing into a plant at least part of an antisense version of the 
nucleotide sequence essentially similar to any one or more of SEQ ID NO 1 to 2755. It should 
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be de.r tha, par. of .he nudelc ack. (a portion) could also achieve the desTed result. 
Homologous anti-sense genes are preferred, homologous genes l>eing plant genes, preferably 
plant genes from the same plant species in which the silencing construct is introduced. 

Detailed analysis of the promoters of the genes identified in the present invertfon a»e 
identification of novel E2Fa/DPa targe, genes that are under the direct control °' E2Fa/DPa 
and that are mainly involved in DNA replication. For all the genes identrfled ,n the present 
invention, roference Is made to the MIPS database MATDB accession number. Th,s un«,ue 
idenfflcation number refers to the deposit of infonnation related to the gene in ,uest,on e^g^ 
the unspliced sequence, the spliced sequence, *e protein sequence, the domains of the 
protein etc. An example of the infonnation deposKed under the accession ""'^^' "'"'f^' 
I Shown in Rgure 4. Based on this unique accession number, a person skilled ,n the art would 
be aWe to locate the gene provided by the present Invention in Its genomic environment. From 
this infom^ation one can Identify and isolate the upstream control elemenU of mese genes^ 
Especially interesting are the promoters of these genes as control elements for dnving or 
regulating .ranscrip.ion of heterologous genes. Therefore, according to the invention « 
provided an isolated nucleic acid comprising one or more of the regulatory -^"^-^^ ^^^ 
of the startcodon of the nucleic acids essentially similar to any one of SEQ ID NO 1 to 2756^ 
Furthemnore, the invention provides an isolated nucleic acid as mentioned above, ^'^'"^^ 
ragulatory element is the promoter of said the genes essentially similar to any one of the 
sequence presented in SEQ ID NO 1 to 2755. 

Further .he InvenBon also relates to the use of a nucleic acid sequence or protein essentially 
similar to any one of SEQ ID NO 1 to 2755, for altering plant charactenstios. 

Another method for altering plan, characteristics and/or growth characteristics of a plant 
reskles in .he use of allelic variants of the genes of the present invention. Allelic variants ex,st 
m nature and encompassed within the methods of the present invenUon is me use of .hese 
natural alleles. Alternatively, in particular breeding programs, such as for example marker 
assisted breeding, or conventional brooding programmes, i. is sometimes practical to introduce 
allelic variation in the plants by mutagenic treaUnen. of a plan.. One sui.able mu.agenic 
method is EMS mutagenesis. Identification of allelic variants then takes place by, for exarnple, 
PCR. This is followed by a selection step for selection of superior allelic vanante of the 
sequence in question and which give rise to altered growth 

typically carried out by monitoring growth performance of plants containing different allelic 
vahants of me sequence in question Monitoring growth perfom,ance can j"-"^ ^ 
greenhouse or in the field. Further optional steps include crossing plan.s in which .he supenor 
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allelic variant was identified with another plant. This could be used, for example, to make a 
combination of interesting phenotypic features. 

According to another aspect of the present invention, advantage may be taken of the nucleic 
acid sequence of the present invention in breeding programs. In such a program, a DNA 
marker may be identified which is genetically linked to the nucleic acid of the present invention. 
This DNA marker is then used in breeding programs to select plants having altered growth 
characteristics. Therefore, the present invention also encompass the use of a nucleic acid 
sequenpe essentially similar to any one of SEQ ID NO 1 to 2755. for marker assisted breeding 
of plants with altered characteristics. 

These marker assisted breeding processes may further involve the steps of crossing plants 
and using probes or primers having part, for example having at least 10 bp, of a sequence 
corresponding to any of SEQ ID NO 1 to 2755, to detect the DNA sequence corresponding to 
15 SEQ ID NO 1 to 2755, in the progeny of the cross. 

These methods for marker assisted breeding also may involve the use of an isolated DNA 
molecule being essentially similar to SEQ ID NO 1 to 2755 or a part thereof as a marker in 
techniques like AFLP, RFLP. RAPD. or in the detection of Single Nucleotide Polymorphisms. 

20 

Further these methods for marker assisted breeding also may involve determining the 
presence or absence in a plant genome of a qualitative trait or a quantitative trait locus (QTL) 
linked to a transgene essentially similar to any one of SEQ ID NO 1 to 2755 or to an 
endogenous homologue of any one of SEQ ID NO 1 to 2755, which method comprises: 
25 (a) detecting a molecular marker linked to a QTL. wherein the molecular marker compnses 
a sequence essentially similar to SEQ ID NO 1 to 2755or an endogenous homologue thereof; 
and 

(b) detemiining the presence of said QTL as by detection of the molecular marker of step 

(a) or determining the absence of said QTL as failure to detect the molecular marker of step (a) 

30 

Alternatively, methods for marker assisted breeding may involve detecting the presence of a 
quantitative trait locus linked to a DNA sequence essentially similar to SEQ ID NO 1 to 2755 or 
to an endogenous homologue thereof in the genome of a plant. The methods described above 
may involve the steps of: 
35 (a) extracting a DNA sample of said plant; 

(b) contacting the DNA sample with a probe that hybridises to a DNA sequence according 
to claim 1 or to an endogenous homologue thereof, or to the complement thereof; 
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,0 performing a hybridisation reaction under condHU>ns suftable for hybnd,sa„on o, the 

probe to the DNA sample of (b); and 

(d) detecting the hybridisation of the probe to the DNA. 

Further the present invention also encompass the use of a nudeic acid sequence essentiaUy 
simtr to any one of SEQ NO 1 to 2756, for conven«ona, breeding of plants wrth a.e.d 

r— i breeding programs, the nuCeic acid essen«y similar to anyone of SHOIO 
NO 1 1, 2755 is used to select plants with better plant characterisfcs -mpa.Bd to the n ™a 
.ild-type Plants, .e plants w«h better growth 

variation in the alleles of the gene corresponding to any one of SEQ ID NU 

onginated from manmade variation in these genes, for example vanat,on created by BUS 

mutagenesis or other methods to created single nucleotide polymorfisms. 

Further the invent^n also relates to the use of a nudeic add or a proteir, essentially similar 
any one of SEQ ID NO 1 to 2755, as a growth regulator. 

,n a par^cular embodiment sud, a growth regulator is a herbicide or is a growth 

presL invention therefore also provides a gro^h '^'^^ -;^'>;^-^-Z 7^ 
acid and/or a protein essentially similar to any one of SEQ ID NO to z od 9 
etula«ng compositions according to the present invention can additionally compnse any 
Xe sually present in growth regulating composnions sud, as growth inh.b.0,.. he* «<^s 
owth stimulators. Also a Kit comprising a sequence essen«a,ly sWIar to any SEQ 
■ ID NO 1 to 2755 (for example in the torn, of a herbicide) is in the scope of *e p esent 
i venuon. «so any other plant effective agent comprising the -^---^^Ir Va" 
present InvenUon are provided herein. Methods to produce the -""f 
gents as mentioned above are also provided by the present ,nvent,on 
Reduction of any one or more of the sequences essentially similar to any one of SEQ ID NO 1 
to 2755. Such sequences and methods are herein provided. 

' Further, plants of the present invention have improved characteristics, such as improved 
growth and yield, whidi makes these plant suitable to produce industrial proteins. 

Accordingly, the present inven«on provides a method for the ^^^^^^^^^ 
5 pham^aceuticals, which method compnses mod^ng expression of ^ "'^'^^'^ 

modifying level and/or activity of a protein, said nudeic add an*or protein being essential y 
similar to any one of SEQ ID NO 1 to 2755 
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The present invention therefore also enconnpasses the use of plants as described above, for 
the production of (industrial) enzymes and/or pharmaceuticals. The (Industrial) enzymes and 
pharmaceuticals produced according to the method as described above are also encompasses 
by the present invention. 

Also the invention as presented herein offers means to alter the characteristics not only of 
plants, but also of other organisms, such as mammals. The plant genes of the present 
invention, or their homologues, or the plant proteins or their homologues, can be used as 
therapeutics or can be used to develop therapeutics for both humans and animals. 
Accordingly, the present Invention relates to a nucleic acid or a protein essentially similar to 
any one of SEQ ID NO 1 to 2755 , for use as a therapeutic agent. 

m a particular embodiment, the use as a therapeutic agents encompasses the use in gene 
therapy, or the use to manufacture medicaments such as for example therapeutic protein 
samples. Also the nucleic acids and/or proteins according to the present invention can be 
applied In diagnostic methods. 

Accordingly provided by the present invention is the use of a nucleic acid or a protein 
essentially similar to any one of SEQ ID NO 1 to 2755. for use as a therapeutic agent, a 
diagnostic means, a kit or plant effective agent. 

Further encompassed by the invention are therapeutic or diagnostic compositions or kits or 
plant effective agent, comprising a nucleic acid and/or a protein essentially similar to any one 
of SEQ ID NO 1 to 2755. These compositions may comprise other additives usually applied for 
therapeutic compositions. Methods to produce these therapeutic or diagnostic compositions or 
kts are also provided by the present invention and involve the production of any of the 
sequences essentially similar to any one of SEQ ID N0 1 to 2755. 

The plants according to the present invention have altered characteristics, such as for example 
improved growth and yield. v«/hich makes them suitable sources for many agricultural 
applications and the food industry. Accordingly, provided by the present invention there is a 
food product derived from a plant or host cell as described above and also the use of such a 
food product in animal feed or food. 

In molecular biology it is standard practice to select upon transfection or transformation those 
individuals (or groups of individuals, such as bacterial or yeast colonies or phage plaques or 
eukaryotic cell clones) that are effectively transfected or transformed with the desired genetic 
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oonstruc Typically these selection procedures are based on the p-sence of a sele^able or 
rdable la J in the transfeCed genetic construct, to distinguish the P-^-'--— 
easily fron, the negative individuals. The nucleic adds and proteins according to the present 
n"ln are capable of altering .he charactedstics of the host cells to which they are ap^,e^ 
Lefore, the nucleic acids and/or proteins according to the present invention can a so be 
used as se.Ca«e marKe,., screenable ™H<ers or selection ^^-'^ '° 
particular en„»dln,ent, the present InvenHon provides the use of a nucie,c acd or a p^ 
essentially similar to any one of SEQ ID NO 1 to 2755 as a posft^e or negative selectable 
marker.during transfomiation of plant cell, plant tissue or plant procedures. 

DESCRIPTION OF THE FIGURES 

Flgu,« 1: volcano plot of signmoance against effect. Each x represent one of 
1 the nega«ve ,og10 of the P value fro. the gene model plotted against the difference 
between lea.-s<,uare .eans for .he genotype effect. The horizontal ^^^J^^^TeZ 
Wise «,resh* of P=0.05. The two vertical reference lines Indicate a 2^o,d cutoff for e«her 

repression or induction. 

Figure 2: Souices of alpha- ketoglutarate and other metabolites In plants, w«h annotation of 
up and do^nregulated genes In the E2Fa.DPa overproducing cells. Upregulated enzymes .e 
u deHlned * a dashed line and enzymes underlined a full line are ~ ^^^t^e 
E2Fa-DPa ve^us wild type planU. Products «,at are boxed act as precursors 
biosynthesis. A -KG, alpha-ketoglutarate; GOGAT, glutamate syn.hetese; N1A2, nrtrate 
reductase, NiR, nitrite reductase. 

Figure Z: Endoreduplicaton levels In wild type and E2Fa/DPa transgenic lines in rela«on to 
nitrogen availability. W,ld type (A) and transgenic (B) lines were grown on minima, medium ,n 
the presence of 0.1, 1, 10. or 60 oM an,.onlum nitrate. Values are means of *ree 
independent measurements. 

' Figure 4: Represents me informa«on which deposed In .he MatDB (MIPS A^Wdopste 
database) under accession number At1g57680 

. DT orR RT-PCR analysis was carried out 

: Figure 5: Verification of microarray analysis by RT-PCR. RT kuk an y 
5 under linear ampl..cation conditions. The actin 2 gene (ACT2) was used as loading control. 
GS. glutamine synthetase; GOGAT. glutamate synthase; NiR. nitrite reductase. 



36 



wo 2(»r>4/()35798 PCT/EP2003/01 1 658 

Figure 6: NMR spectrum of E2Fa/DPa overexpressing plant cells. 



Table 1: Presentation of Arabidopsis genes that are 2 fold or more upregulated in E2Fa-DPa 
overexpressing plants. Tlie genes are presented according to the functional category to which 
they belong. For some of the genes, no function has been described in the public databases and 
they are named unknown, putative or hypothetical protein. All the genes have each a unique 
MIPS accession number, which refers to the identification of the sequence in the MatDB (MIPS 
Arabidopsis tlialiana database). The MIPS accession number refers to the protein entry code for 
the MatDB of MIPS. Also, there is an accession number provided as an internal protein code. 
The fold of induction is also given for each sequence. Furthermore, where an E2F target 
sequence has been identified in the upstream region of the gene, the sequence of that site is 
also presented In the Table. Finally, other plant homologues which have substantial sequence 
identity with the Arabidopsis gene are mentioned in this Table. 

Table 2: Presentation of Arabidopsis genes that are 2 fold or more repressed in E2Fa-DPa 
overexpressing plants. Data are presented in a similar way as for Table 1, as explained above. 

Table 3: Different E2F target sequences and the frequency of their presence in the upstream 
regions of the Arabidopsis genes described in the present invention. 

Table 4: Selection of the Arabidopsis genes from the microanay that were 1.3 times 
upregulated in E2Fa/DPa overexpressing plants, compared to the wild-type plants. The gene 
name is given, as well as the MIPS database accession number and a ratio indicating the 
degree of upregulatlon of the gene. Furthermore, the E-value indicates if a significant 
homologue has been found in the public databases. 

Table 5: Selection of the Arabidopsis genes from the microan-ay that were 1.3 times repressed 
in E2Fa/DPa overexpressing plants, compared to the wild -type plants. The data are 
presented as in Table 4. The fold repression is calculated as 1 /ratio. In this Table only the 
genes that have a ratio of less than 0.77 are selected. 

Table 6: genes selected for Arabidopsis transformation 

Table 7: genes selected for rice transformation 
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EXAMPLES 

o, .on,ozygous CaMV35S.E2Fa and CaMVSSS-DPa plan.s ,De ^^^"^^ ^^^^ 
tran^formants were grown under a 16h ,igW 8h dark phCopenod at 22 C germ.na 
medium (Valvekens et al., 1988). 

Selection of transgenic lines _ 

AraWdops/s ma//ana plants were generated that ocntaned erther the KFa or me DP 

Ider the contro, of the constitutive caui^ower mosaic virus (CaMV) 35S promoter. 

r r rn:ro:r:alo^^^^^^^^ ana.sis on individual piants oon«rmed which 
piantscontained both the CaM.3.S-H2Pa and eaM.3aS.O^^^ 

:;:ra7:L:ri^^^ 

ESTs. These experimental steps are described in the following examples. 
gvamnfe 2: C' ^r,-=tn,ction ofMicmairavs 

construction of iW/croarrays ^^p,.^^,^ 
The A.<,/dop.s thaliana microarray eioScienoes, 
distant from each other, on Type V silane coatea sua 
Linghamshire, UK,. The Cone set included ^ ^^''^^^^^^^^ 
unigen clone collection from Incyte (Aratodopsfe Gem 1, Incyte, USA). 

annotauon of the genes relating to the spotted ESTs, BLASTN ^^-'-[^^^^^^ 
was performed. To make the analysis easier a collection of genom,c sequences, bearing on^ 
Z gtl was buiK according to fl,e available annotations Each of those ^'^'^^"'^^ ^ 
, ZZ intergenlc sequence followed by the exon-intron structure of '^J^J^2 
dLnst^am intergenic sequence, Intergenic being the ^^^- ^l^^^^^^^^Z Z 
3ndstopcodonsfromneighbonng.ot.^^^^^^^^ 

hitc; wpre extracted and submitted to a BLAb t a searcn ayo k 
rrliledinfonnation concerning the poten«alfun.i. 

5 ware searched using PmDom. The complete data set can be found on he 

--^i^rire:^^^^^^^^^^ 

amplified using M13 primers, purified with Multiscreen Kot^ p 
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Belgium) and arrayed on the slides using a Molecular Dynamics Generation III printer 
(Amersham Biosciences). Slides were blocked in 3.5%SSC, 0.2%SDS, 1% BSA for 10 
minutes at 60°C. 
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RNA amplification and labeling 

Antisense RNA amplification was performed using a modified protocol of in vitro transcription 
as described earlier in Puskas et al. (2002). For the first strand cDNA synthesis. 5 yg of total 
RNA was mixed with 2 pg of an HPLC-purlfied anchored oligo-dT + T7 promoter (5'- 
GGCCAGTGAATTGTAATACGACTCACTATAGGGAGGCGG-T24(A/C/G)-31 (SEQ ID NO 
2756) (Eurogentec, Belgium), 40 units of RNAseOUT (cat# 10777-019. Invitrogen. Merelbeke. 
Belgium) and 0.9M DW trehalose (cat# T-5251. Sigma Belgium) in a total volume of Upl. and 
heated to 75°G for 5 minutes. To this mixture. 4 pi 5x first strand buffer (Invitrogen. Belgium). 2 
I 0 1 M DTT, 1 pMO mM dNTP mix. 1 pl1.7 M D(+)trehalose (cat# T-5251. Sigma Belgium) 
and 1 pi. 200 Units of Superscript II (cat#: 18064-014. Invitrogen. Belgium) was added in 20 pi 
final volume. The sample was incubated in a Biometra-Unoll thermocycler at 37°C for 5 
minutes 45''C for 10 minutes. 10 cycles at 60°C for 2 minutes and at 55°C for 2 minutes. To 
the first strand reaction mix, 103.8 yl water, 33.4 pi 5x second strand synthesis buffer 
(Invitrogen. Belgium). 3.4 p\ 10 mM dNTP mix. 1 pi of 10U/pl E.coli DNA ligase (cat#: 18052- 
019. invitrogen. Belgium). 4 pi 10 U/pl E.coli DNA Polymerase I (cat#: 18010-025. Invitrogen. 
Belgium) and 1 pi 2U/pl E.coli RNAse H (cat#: 18021-071. Invitrogen. Belgium) was added, 
and incubated at 16-C for 2 hours. The synthesized double-stranded cDNA was purified with 
Qiaquick (cat#: 28106, Qiagen. Hilden. Gemiany). Antisense RNA synthesis was done by 
AmptiScribe T7 high yield transcription kit (cat#: AS2607: Epicentre Technologies. USA) in 
total volume of 20 pi according to the manufacturer's instructions. The RNA was purified with 
RNeasy purification kit (cat#: 74106; Qiagen. Gemiany). From this aRNA. 5 pg was labeled by 
reverse transcription using random nonamer primers (Genset. Paris. France), 0.1 mM 
d(G/T/A)TPs. 0.05 mM dCTP (Amersham Biosciences, UK), 0.05 mM Cy3-dCTP or Gy5-dCTP 
(caW PA53023, PA55023; Amersham Biosciences, UK) 1x first strand buffer, 10 mM DTT and 
200 Units of Superscript II (cat#: 18064-014. Invitrogen, Belgium) in 20 pi total volume. The 
RNA and primers were denatured at 75'C for 5 minutes and cooled on ice before adding the 
remaining reaction components. After 2 hours incubation at 42X. mRNA was hydrolyzed in 
250 mM NaOH for 15 minufes at 37-C. The sample was neutralized with 10 pi of 2 M MOPS 
and purified with Qiaquick (cat#: 28106. Qiagen. Germany). 

Array hybridization and post-hybridization processes 

The probes were resuspended in 30 pi hybridization solution (50 % fomnamide. 5x SSC. 0.1 % 
SDS. 100 mg/ml salmon spemi DNA) and prehybridized with 1pl poly-dT (Img/ml) at 42X for 
30 minutes to block hybridization on the polyAn" tails of the cDNA on the arrays. 1 mg/ml 
mouse COT DNA (ca* 18440-016. Invitrogen. Belgium) was added to the mixture and placed 
on the array under a glass coverslip. Slides were incubated for 18 hours at 42-C in a humid 
hybridization cabinet (cat#. RPK0176-. Amersham Biosciences. UK). Post-hybridizat.on 
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washing was performed for 10 minutes at 56»C in IxSSC. O.m SDS. two SmiTl^^^^^^ 
at 56°C in O.lxSSC. Q.m SDS and for 2 minutes at 37''C in O.lxSSC. 

Scanning and data analysis 

Arrays were scanned at 532 nm and 635 nm using a Generation ill scanner (Amersham 
Biosciences, UK). Image analysis was performed with ArrayVision (Imaging Research Inc. 
Ontario. Canada). Spot intensities were measured as artifact removed total intensities (ARVol). 
No background correction was performed. First, within-slide normalization was addressed by 
plotting^for each single slide a "MA-plot" (Yang et al.. 2002), where M = log^ (R/G) and A = log^ 
0.5VRXG. The "LOWESS" normalization was applied to correct, for dye-intensity differences. 
Subsequently, in order to normalize between slides and to identify differentially expressed 
genes between the two genotypes, two sequential analyses of variance (ANOVAs) were 
applied, proposed by Wolfinger et al. (2002), as follows: 1) firstly, the base-2 logarithm of the 
LOWESS".transformed measurements for ail 73.264 spots (y,^) was subjected to a 
normalization model of the fomi y,«„ = ^ + AAR. s,,,„. where Is the sample mean, A. 
.s the effect of the /cth an-ay (k = 1-4). A.D,R„ is the channel-effect (AD) for the mth replication 
of the total collection of cDNA fragments (m = 2; left or right), and s,,. is the stochastic error 2) 
secondly, the residuals from this model were subjected to 4,579 gene-specific models of the 
fomi r,j,, - n + GA. GA + GPj -f y,, where GA. is the spot effect, GA is the gene-specific 
dye effect, GPj is representing the signal intensity for genes that can specifically be attributed 
to the genotypes (effect of interest), and y,j, is the stochastic error. All effects were assumed to 
be fixed effects, except e,,^ and y^,, A t-test for differences between the Gpj effects was 
performed, where the t-tests are all based on n.+n,.2 degrees of freedom corresponding to the 
n. WT hybridisations and n^ E2Fa-DPa hybridisations. The p-value cutoff was set at 0.01. No 
further adjustment for multiple testing was performed, as Bonferroni adjustment for 4.579 tests, 
to assure an experiment-wise false positive rate of 0.05. results in a p-value cutoff of le"^"' 
which is certainly too consen/ative; therefore it was chosen to set the p-value cutoff arbitrarily 
at the 0.01 level. Also GA effects were estimated and t-tested for significance at the m level 
.n a same way as described above. Genes with a significant GA effect were discarded 
Genstat was used to perfomi both the normalization and gene model fits. 

Example 3: Results of the Mirroan-av analysis, .n d statistinn, .n.iy.!. 
A micro-array containing in duplicate 4579 unique Arabidopsis ESTs, representing about a 
sixth of the total genome, was used to compare the transcriptome of wild type with that of 
E2Fa-DPa overexpressing plants. cDNA was synthesized from total RNA isolated from wild 
type and transgenic plants harvested 8 days after sowing. At this stage, transgenic plants were 
distinguished from control plants by the appearance of curled cotelydons which display ectopic 
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ceirdUTorand enhanced endoredupllcation (De Veylder et al.. 2002). In the first two 
hybridizations Cy3 and Cy5 fluorescently labeled probe pairs of control and E2Fa-DPa cDNA 
were used using independent mRNA extractions of the E2Fa.DPa plants. Subsequently, a 
dye-swap replication was performed for both hybridizations, resulting in a total of four cDNA 
5 microan^ay hybridizations. 

Fluorescence levels were analyzed with the aim to establish whether the level of expression of 
each gene varied according to overexpression of the E2Fa-DPa transcription factor. Two 
sequential analyses of variance (ANOVAs) were used, as proposed by Wolfinger et al. (2002). 

10 The first ANOVA model, called the "normalization" model, accounts for expenment-w.se 
systematic effects, such as array- and channel-effects, that could bias inferences made on the 
data from the individual genes. The residuals from this model represent normalized values and 
are the input data for the second ANOVA model, called the "gene" model. The gene models 
are fit separately to the normalized data from each gene. This procedure uses differences in 

15 nomialized expression levels, rather than ratios, as the unit of analysis of expression 
differences. 

Prior to the estimation of genotype-specific signal intensities of the genes (G^ effects), which 
are the effects of interest, gene-specific dye effects (G,D, effects) were estimated and t-tested 
20 for significance at the Wo level. One hundred and thirty one genes showed a significant GA 
effect and were discard from further analysis. For each of the remaining 4.448 genes on the 
arrays a t-test on the GA effects for significant differences (p<0.05) was perfomied. Figure 1 
plots the obtained p-values (as the negative log1 0 of the p-value) against the magnitude of the 
effect (log2 of estimated fold change). This volcano plot illustrates the substantial difference 
25 significance testing can make versus cutoffs made strictly on the basis of the fold change. The 
two vertical reference lines indicate a 2-fold cutoff for either repression or induction, while the 
horizontal reference line refers to the p-value cutoff at the 0.05 value. These reference lines 
divide the plot into six sectors. The 3.535 genes in the lower middle sector have low 
significance and low fold change, and both methods agree that the con-esponding changes are 
30 not significant. The 188 genes in the upper left and right sectors have high significance 
(p<0 05) and high fold change (>2); 84 of these genes show a significant two-or-more-fold 
induction of expression, where the remaining 104 genes show a significant two-or-more-fold 
repression of expression in the E2Fa-DPa plant. Finally, the 715 genes in the upper middle 
- sector represent significant (p<0.05) up- or downregulated genes, but with a low (<2) fold 
35 change. The full dataset of genes can be viewed at httn //www.psb.ruq.ac.be /E2F, which 
dataset is incorporated herein by reference. 
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sequences that are 1.3 times upregulated (ratio of more than 1.999) in E2Fa-DPa 

overexpressing plants are presented in Table 4. All the sequences that are 1 3 times 
repressed (calculated as 1/ ratio of less than 0.775) are presented in Table 5. Particularly 
.nterestlng genes that are more than 2.fold upregulated or 2 fold repressed are selected and 
separately represented in Tables 1 and 2. 

Bxamole 4: an/./, /,..-. 

The identity of me genes was confirmed by sequencins, and the induction of a random set of 
genes was oonflmied by RT-PCR analysis (Figure 5) 

RNA was isolated from plants 8 days aner sowing with the Trizo, reasen, (Amersham 
Biosaenoes). Flrst^trand cDNA was prepared fram 3 pg of total RNA with the Superscript RT 
II tat (invitrogen) and oligo(dT,18 according to the manufacturer's instrirctions. A 0.26 pi aliquot 
Of the total RT reaction volume (20 pi, was used as a template in a semiK,uan«ative RT- 
medrated PCR ampimcation. ensuring that the amount of amplified product remained in linear 
proportion to the initial template present in the reaction. From the PCR reaction, 10 pi was 
separated on a 0.8% agarose gel and transfened onto Hybond membrane, (Amersham 
Broscences) that were hybridized at eS'C with fluorescein-labeled pmbes (Gene Images 

ZT2 ^"'"'"'^ ^'^^^l hybridized bands were detected with the 

CDP Star detection module (Amersham Biosciences). Primers used were 

5'-AAAAAGCAGGCTGTGTCGTACGATCTrCTCCCGG-3' (SEQ ID NO 2757) and 5'-AGAAA 

^a™^ '° "° ^^=^> ^ 5'.ATAGTCGACTCAGCGA 

CTcIg ^r""' '° 5..CAGATCTTGTTAACCTTGACAT 

CTCAG-3 (SEQ ID NO 2761) and 5'-GGGTCAAAAGATACAACCACACCAG-3' (SEQ ID NO 
2762) for glutamlne synthetase (GS). 6'^GTTTACGAGCTACATGGGCC-3' (SEQ ID NO 

^r!.?:! <^^° '° ^° 5.GAACGCCA 

ZZl^T"^'"' '° ''''' ^'^'""^ ^^^'^'-^ "'^^ 5'-GTTACCGG 

TsEQ ^D^™ r ''''' ^■-<=«TCGGAGGGAMGTCTGACG-3. 

SEQ D NO 2768) for LOB domain protein 41. 5'-GTGTGGTrTCCAAGCmCCTACG-3' 
(SEQ ID NO 27S9) and S'-GGTGAAGGGACTAGCCTTGTGG-S' (SEQ ID NO 2770) for 
.socitrate lyase, 5'-GGGATCAATCCTCAGGAGAAGG-3' (SEQ ID NO 2771) and 5' 
^^J^^'l'^-^^^'^'^^'^'^^y^^' (SEQ ID NO 2772) for nitrite reductase (NIR), and 5'- 
TTACCGAGGCTCCTCTTAACCC-3. (SEQ ,D NO 2773) and a'-ACCACCGATCCAGACA 
CTGTAC-3' (SEQ ID NO 2774) for acUn 2 (ACT2). 
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re^ qulated cellular processes 

The genes of the present invention identified from the microarray experiment of Example 2 
have unique identification numbers (MIPS accession number e.g. At1g57680). The MIPS 

5 accession number Is v^idely accepted in this field as it directly refers to the genomic sequence 
and the location of the sequence in the Arabidopsis thaliana genome. Accession numbers are 
allocated by the Munich Information Center for Protein Sequences (MIPS) and are stored .n 
the MIPS Arabidopsis database. Publicly available sequence and annotation data from all 
other AGl ^Arabidopsis Genome Initiative") groups are included to establish a plant genome 

10 database (Schoof H. et al. (2002)). The MIPS Arabidopsis database can be accessed via the 
internet KHp-//,.ip. n.f H./cni-bin/proi/thal and the database can be searched v^ith the protein 
entry code (eg. At1g57680). An example of the type of sequence infomnation and protein 
domain information that is provided for a certain sequence In the MIPS database Is shown 
Figure 4. 

15 ^ ^ 

An additional blast search with the genes according to the present invention was performed on 

public databases containing sequences from other plant species and other organisms. For 

some of the genes identified by the microan^y. significant levels of homology (low E-values) 

were found with sequences from other organisms (see Tables 1 and 2 with reference to the.r 

20 Genbank accession number). So far. mainly com and rice homologues were identified, but as 

more genomes will be sequenced in the future, many more homologues will be identifiable by 

the person skilled in the art as useful in the methods of the present invention. 

DNA replication artd cell cycle genes 
95 Genes up or downregulated in the EZFa-DPa overexpressing plants can be classified into 
clear groups according their function (Tables 1 and 2). 14 Genes that are 2-foJd or more 
upregulated belong to the class of genes involved in DNA replication and modification, 
correlating with the obsen/ation that E2Fa-DPa overexpression plants undergo extensive 
endoreduplicatlon. Most of these genes have previously be shown to be upregulated by E2F- 
30 DP overexpression in mammalian systems including a putative thymidine kinase, replication 
factor c. and histone genes (4 different ones). Other E2Fa-DPa induced S phase genes 
include a linker histone protein, the topoisomerase 6 subunit A and two subunits of the h.stone 
acetyltransferase HAT B complex, being HAT B and Msi3. The HAT B complex is responsible 
- - for the specific diacethylation of newly synthesized histone H4 during nuclease assembly on 
35 newly synthesized DNA (Lusser et al.. 1999). Also a DNA methyltransferase responsible for 
the methylation of cytosine in cells that progress though S phase was identified among 
upregulated genes. 
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Besides the overexpressed E2Fa gene (being 90-foJd more abundant in the E2FaPa 
overexpressing plants, compared to control plants), only one cell cycle gene {CDKB1;1) shows 
a 2-fold or more change in expression level upon E2Fa-DPa overexpresslon. CDKB1;1 was 
previously predicted to be a candidate E2F-DP target by virtue of a consensus E2F-DP-blnding 
site in Its promoter (de Jageret al., 2001). Whereas CDKB1;1 activity is maximum at the G2/M 
transition, its transcript levels start to rise during late S-phase (Porceddu et al., 2001; Menges 
and Murray, 2002). Upregulation of CDKB1;1 might therefore be a mechanism to link DMA 
replication with mitosis. 

Cell wall biogenesis genes 

Four members of the xyloglucan endotransglucosylase (XET) gene family were found to be 2- 
fold or more upregulated in E2Fa-DPa overexpressing plants, one of them identical to the 
Meri-5 gene (Medford et a!., 1991). XETs are enzymes that modify cell wall components and 
therefore play a likely role in altering size, shape and physical properties of plant cells. 
Reversal breakage of the xyloglucan tethers by XETs has been proposed to be a mechanism 
for allowing cell wall loosening in turgor-drlven cell expansion (Campbell and Braam, 1999). 
However, there are several reasons to believe that E2Fa-DPa induced XETs are not required 
for cell expansion. First, cells divide more frequently in E2Fa-DPa overexpressing plants, but 
the overall cell size of the cells is smaller. Therefore, no overall increase in expansion-rates Is 
needed. Second, correlated with the absence of increased cell expansion in the transgenic 
lines, no induction of genes with a known role in this process, such as expansins, can be seen. 
Therefore, the hydrolytic activity of the XETs might be required to incorporate the newly 
synthesized cell walls formed during cytokinesis Into the existing cell wall structure. 
Alternatively, as XET activity has shown to be involved In the postgerminatlve mobilization of 
xyloglucan storage reverses in Nasturtium cotelydons (Farkas et al., 1992; Fanutti et al., 
1993), induction of XETs in E2Fa-DPa overexpressing plants might relate to polysaccharide 
breakdown to serve the metabolic and energy needs which are required to synthesize new 
nucleotides (see below). 

interestingly, two XETs were identified in the set of 2-fold or more downregulated genes. 
These XETs are more related to each other than to the induced XET proteins. This differential 
response of XETs towards the E2Fa/DPa induced phenotypes suggests that plant XETs can 
be classified in at least 2 different functional classes. 

Genes involved in metabolism and biogenesis 
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Both the group of up and down regulated genes contains a relative large group of genes 
involved in metabolism and biogenesis. Most remarkable is the induction of genes involved in 
nitrogen assimilation, such as nitrate reductase (NIA2) (see Figure 2). glutamine synthetase 
(GS), and glutamate synthetase (GOGAT). Although not present on the microan-ay. the nitrite 
reductase (NiR) gene was found to be induced in the transgenic line, as demonstrated by RT- 
mediated PGR analysis. Nitrogen and nitrite reductase catalyse the first step in the nitrogen 
assimilation pathway, whereas glutamine and glutamate synthetase are involved in both the 
primary assimilation from nitrogen as reassimilation of free ammonium, supplying all plants 
nitrogen needed for the biosynthesis of amino-acids and other nitrogen-containing compounds. 

There are other indications that nitrogen metabolism is altered in E2Fa-DPa overexpressing 
plants, such as the modification of genes reported to be involved in Medicago induced 
nodulation (MTN3 and a nodulin-like gene); and the downregulation of genes involved in sulfur 
assimilation (adenylylsulfate reductase (APR; 2 different genes) and a putative adenine 
phosphosulfate kinase). Genes involved in sulfur assimilation such as APR have previously 
been shown to be transcriptionally doWnregulated during nitrogen deficiency (Koprivova et al.. 
2000). 

Upregulation of nitrogen assimilation genes in E2Fa-DPa overexpressing plants might reflect 
the need for nitrogen for nucleotide biosynthesis, as purine and pyrimidine bases are nitrogen- 
rich. If nitrogen assimilation was indeed stimulated by E2Fa/DPa overexpression. two 
requirements should be fulfilled. Since nitrogen assimilation through the GS/GOGAT pathway 
requires a-ketogluterate (Lancien et al.. 2000). a first requirement is that there should be 
enough a-ketogluterate to act as acceptor molecule for ammonium. Secondly, because 
assimilation of nitrogen is energy consuming, the rate of reductant production should be higher 
in an E2Fa/DPa transgenic than in wild-type plants. 

Our micro-array data suggest that in the E2Fa-DPa overexpressing plants, a-ketogluterate 
accumulation is stimulated in different ways. First, a-ketogluterate production is improved by 
increased photosynthetic activity, as indicated by the 4.7-fold upregulation of large subunit of 
ribulose-1.5-bisphosphate carboxylase/oxygenase (Figure 2). This results in an accumulation 
of glyceraldehyde-3-phosphate. Glyceraldehyde-3-phosphate can be converted into fructuse- 
1,6-bisphosphate by fructose bisphosphate aldolase. However, a 6-fold downregulation of the 
- fructose bisphosphate aldolase gene rather suggests the conversion of glyceraldehyde-3- 
phosphate into pyruvate, which can be converted into a-ketogluterate during glycolysis in the 
citrate cycle. The preferential conversion of glyceraldehyde-3-phosphate into pyruvate in 
favour of sugars fits the higher need for amino-acids than for sugars for nucleotide 
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biosynthesis. A shortage for rlbose-5-phosphate for nucleotide synthesis is also evident from a 
downregulation of sucrose-phosphate synthase, resulting in decreased conversion of fructose- 
6-phosphate and glucose-6-phosphate into sucrose (Figure 2). 

A second source of a-ketogluterate Is provided in the glyoxylate cycle by the 3.1 fold increase 
in expression of isocitrate lyase, suggesting an increased lipid turnover in E2Fa-DPa 
overexpressing plants. Isocistrate lyase activity cleaves isocitrate into glyoxylate and succinate 
(Figure 2). Whereas the formed glyoxylate can be converted into glycine, which is also 
required for nucleotide biosynthesis, succinate can be converted into a-ketogluterate in the 
citrate cycle. A 2.3-fold decrease of the fumarase gene presumably stimulates the conversion 
of produced a-ketogluterate into glutamate by causing an accumulation of succinate and 
fumarate, which is also a side product formed during nucleotide biosynthesis (Figure 2). 

Assimilation of nitrogen Is energy consuming. When rates of nitrate reduction are high, this 
pathway becomes the major sink for reductant. About 10% of the electron flux in 
photosynthesizing leaves is used for nitrate reduction. The amount of required reductant, 
which in leaves originates from electronic photosynthetic electron transport, is therefore 
expected to be higher in the E2Fa-DPa transgenics. Con-espondingly, several components of 
the chloroplast electron transport chain and associated ATP-synthesing apparatus, such as 
cytochrome B6, a RSI I subunit and the ATPase epsilon subunit are upregulated in the 
transgenic plants. Increased expression of the protochlorophyllide reductase precursor 
suggests that an increase in chlorophyll biosynthesis is stimulated in E2Fa-DPa 
overexpressing plants. 

Famine of nitrogen has a putative impact on amino-acid biosynthesis, as three different amino- 
acid aminotransferases, are downregulated in E2Fa-DPa overexpressing plants. Accompanied 
with a putative decreased aminotransferase activity is the observed reduction in expression of 
an enzyme Involved in pyridoxine biosynthesis. Shortage of nitrogen-rich amino-acids is also 
evident from reduced expression of genes encoding vegetative storage proteins (VSPIand 
VSP2); and ERD10, a protein with a compositional bias towards glu (Kiyosue et al., 1994). 
Additional evidence for amino acid shortage comes from downregulation of a myrosinase- 
binding protein and cytochrome P450 monooxygenase CYP83A1. Both proteins are involved 
in the biosynthesis of glucoslnolates, being nitrogen and sulfur containing products derived 
from amino-acids (Wittstock and Halkier, 2002). 



Transcription factors and signal transduction 
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A total Of 4 transcription factors were identified among the genes being 2-fold or more 
upregulated, including two homeobox domain transcription factors. Among them the 
anthocyaninless2 (ANL2) gene was identified; which is involved in anthocyanin accumulation 
in subepidermal leaf cells (Kubo et al., 1999). The lack of an obvious increase in anthocyanin 
accumulation in E2Fa-DPa overexpressing plants suggests a role forthe ANL2 protein in plant 
development different from anthocyanin production. This hypothesis is substantiated by the 
observation that anl2 mutant plants contain extra ceils in the root between the cortical and 
epidemnal layers (Kubo et al., 1999). 

The second upregulated homeobox domain transcription factor is Atbh-6. Expression of Atbh-6 
is restricted to regions of cell division and/or differentiation and has been shown to be inducible 
by water stress and ABA (Soderman et al.. 1999). Other putative ABA sensitive genes can be 
recognized among the E2Fa-DPa induced clones, as well as the cold regulated protein 
COR6.6. a seed imbitition-like protein and a dormancy-associated protein. Here again, 
changes in expression level of these genes might be con^elated with modifications In carbon 
metabolism. A link between ABA and sugar signaling is evident from the identification of 
several loci involved in both sugar and hom)onal responses (Finkelstein and Gibson. 2002). 
Alternatively, it might be the occurrence of enhanced endoreduplication and/or cell division 
itself that causes a change in the osmotic potential. 

Among the downregulated transcription factors a DOF family member is present. Many DOF 
transcription factors are participating in the regulation of storage protein genes and genes 
involved in carbon metabolism (Gualberti et al.. 2002). Its downregulation might therefore be 
linked with the shortage of amino-acids due to the high demand of nitrogen for nucleotide 
biosynthesis. 

Other regulatory genes modified in E2Fa-DPa overexpressing plants include protein kinases, 
several putative receptor kinases, a putative phytochrome A. and WD-40 repeat containing 
proteins (Tables 1 and 2). Interestingly, a SNF1-like kinase-is downregulated 2-fold In E2Fa- 
DPa overexpressing plants. In addition to its proposed role in sugar signaling, the SNF1 kinase 
also negatively regulates the activity of plant nitrate reductase (Smeekens, 2000). 

Example 6: Endoreduplication levels of F ^F^-DPa Plants are nitrogen dependent 
The modified expression of a large number of metabolic and regulatory genes, directly or 
indirectly linked to nitrogen metabolism, suggests a direct relationship between the high 
endoreduplication levels found in the E2Fa/DPa transgenic plants and nitrogen availability. To 
test this hypothesis, wild type and transgenic plants were grown on medium containing 
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different levels of ammoniumnitrate, ranging from 0.1 to 50 nnM. Eight days after germination 
ploidy levels in these plants was determined by flow cytometry. Increasing ammoniumnitrate 
levels hardly had an effect on the ploidy distribution pattern in wild type plants (Figure 3A). In 
contrast, in the E2Fa-DPa transgenic plants increasing ammoniumnitrate levels resulted in a 
5 reproducible 'and significant increase in the amount of 32C and 64C nuclei (Figure 3B). 
Comparing the lowest with the highest concentration of ammonimumnitrate, an increase of 
32C from 2.0 (± 0.3) % to 6.5 (+ 1.5) %, and of 64C from 0.7 (±0.3) % to 2 (± 0.5) % can be 
seen. Increasing ammonium levels did not have any effect on the plant phenotype, as plants 
remained small with curled leaves on all concentrations of nitrogen tested. These data indicate 
10 that the endoreduplication levels in the E2Fa-DPa overexpressing plants are limited by 
nitrogen availability, and that an excess of nitrogen is incorporated into new DNA than in other 
nitrogen containing compounds. 

Example 7: Promoter analysis ofE2Fa-DPa regulated genes 

1 5 Promoter analysis 

The intergenic sequence corresponding to the promoter area of each gene spotted on the 
microarray was extracted from genomic sequences. These genomic sequences are easily 
accessible for example from the MIPS MatDB database (http://mips.gsf.de/proj/thal/db/). From 
those intergenic sequences, up to 500bp upstream of the ATG start codon were extracted and 

20 subjected to motif searches in order to retrieve potential E2F elements. Both position and 
frequency of occurrence was determined using the publicly available execuTable of 
Matlnspector (version 2.2) using matrices extracted from PlantCARE and matrices made 
especially for this particular analysis (Lescot et al., 2002). The relevance of each motif was 
evaluated against a background consisting of all the sequences from the dataset. 

25 

Results 

To distinguish In the present data set the putative direct target genes of E2Fa-DPa from the 
secondary induced genes, the first 500 bp upstream of the ATG start codon of the genes with 
2-fold or more change in expression was scanned for the presence of a E2F-like binding site 

30 matching the sequence (A/T)TT(G/C)(G/C)C(G/C)(G/C) (SEQ ID NO 2775). Of all the different 
pennutations possible, only the TTTCCCGC (SEQ ID NO 2776) element was statistically 
enriched in the set of E2Fa-DPa upregulated genes, suggesting it is the preferred binding site 
of the E2Fa-DPa complex (Table 3). Moreover, target genes containing this element belong 
■ mainly to the group of genes involved in DNA replication and modification, being the main 

35 group of target genes in mammalian systems. These data illustrate that the TTTCCCGC 
sequence is the most likely cis element recognized by E2Fa-DPa. The observation that not all 
genes having this DNA sequence in their promoter suggests that the presence of the 
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TTTCCCCGC motif is not sufliclert to make a gene responsive towards E2Fa-DPa, and that 
E2Fa-DPa oo-operates with otherfactors to acUvale transcription. 

„ is not excluded that genes wShout an E2F-like.binding site are not directly activated by 
E2Fa-DPa. Chromatin immunopredpitation experiments have shown that mammahan KF 
tsctors can bind to promoters w«hou. a clear E2F recognition motif (Kiyosue e^ a/., 99^, 
suggesting that E2FDP might reocgntee non-canonical binding sites, or might be reon.,ted i^y 
promoters through the association of Cher facto., in this respect, the C*^«a v*ans n,^e 
reductase gene, of which the Ara«dops/s homCogue was shown herein to be '"^"""J 
DPa, binds an E2F-DP complex, although a clear consensus binding site ,s lacking (Cannons 

and Shiflett, 2001). 

£2F can activate as well as repress promoter activity. 

in the Nlcotiana benthamlana PCNA promoter a E2F sequence was identified actrng as a 
negative regulatory element dunng development (Egelkrout et al., 2001). Also the tobacco 
nbonucleotide reductase small subunit gene contains a E2F element wori<ing as a repmssor 
outside the S-phase (Chaboute et al., 2000). in the set of downregulated genes no particular 
enHchment of a speCc E2F sequence could be seen (Table 3). Therefore the inven o. 
believe that the E2Fa.DPa complex mainly worics as a transcriptional activator, and that other 
E2F-DP complexes are involved in E2F-mediated transcriptional repression. 

.^^M. «• .nrfrv/d r - -^.^^.^-^-xnn of .^nme r,Pne.s /denfrfieri h,. the method of tCe 

nrti sent invention . . 

The genes characten^ed hereunder, are particulariy useful for making plants wrth improved 
growm characterisacs. These preferied genes are introduced Into a plant and upreguiated or 
downregulated in order to simulate E2Fa/DPa effects and/or to after one or more 
diaracteristics of a plant. The particular growth oharaoteristlc that may be influenced by these 
genes, is described in the following paragraphs by reference to the biological function of that 
particular gene. 

At1g07000 showing iiomology to leucine zipper 

At1g07000 is a potential leucine zipper that is not preceded by a basic domain. The leucine 
zipper consists of repeated leucine residues at every seventh position and mediates protein 
. dimerization as a prerequisite for DNA-b.nding. The leucines are directed — ^ ^'^^ 
an alpha-helix. The leucine side chains of two polypeptides are mought to ;n«-*fl»-^ 
dlmerfcation (knobs-into-holes model). The leucine zipper may dictate dimenzation specificrty. 
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ne 2,ppers are DNA binding protein with dimerizatlon properties, having important 
functions in development and stress tolerance in plants. 

Atl 909070 showing homology to Soybean Cold Regulate gene SRC2 
This genes arid its expressed protein is predicted in Arabidopsis, rice, com, soybean, however 
based on a homology search using the BLAST program, no functional homologue was known' 
not even a dear animal homologue. so no clear function can be predicted for this gene or 
protein (Takahashi.R. and Shimosaka.E. (1997)). 

At1g21 690 showing homology to Replication factor 
Replication factor C (RFC) is a pentameric complex of five distinct subunits that functions as a 
clamp loader, facilitating the loading of proliferating cell nuclear antigen (PCNA) onto DNA 
dunng replication and repair. More recently the large subunit of RFC. RFC (p140) has been 
found to interact v.lth the retinoblastoma (Rb) tumor suppressor and the CCAAT/enhancer- 
binding protein alpha (C/EBPalpha) transcription factor. It is reported that RFC (p140) 
associates with histone deacetylase activity and interacts with histone deacetylase 1 (HDAC1) 
(Anderson, L. A. and Perkins. N. D. (2002); Furukawa.T. et al. (2001)) RFC is poorly known in 
plants. It can be important for development for modulating gene expression during cell cycle at 
s phase, or through chromatin regulatton, 

At1g23030 showing homology to Armatllllo protein 

Members of the armadillo (anm) repeat family of pmteins are implicated in tumorigenesis 
ennbryonio development and maintenance of tissue integrity. ARM proteins participate in 
hnlang oytoskeleton to membrane proteins and structures. These proteins share a central 
domain that is composed of a series of imperfect 45 amino acid repeats. Amiadillo family 
members reveal diverse ce.llular locations reflecting their diverse functions. A single protein 
exerts several functions through interactions of its armadillo repeat domain with diverse 
bindrng partners. The proteins combine structural roles as cell-contact and oytoskeleton. 
assocated proteins and signaling functions by generating and transducing signals affecting 
gene expression. The study of am,adillo family members has made it increasingly dear that a 
d,s^nct,on between structural proteins on the one hand and signaling molecules on the other is 
rather artificial. Instead amiadillo family member exert both functions by interacHng with a 
number of distinct cellular-binding partners. Proteins of the amradillo family are Involved in 
diverse cellular processes in higher eukaryotes. Some of them, like annadillo, beta^catenin and 
Plakoglobins have dual funcfions In Inter^llular Juncaons and signalling cascades. 
belonging to the importin-alpha-subfamily are Involved in NLS {Nuclear localization signal) 
recognition and nuclear transport, while some members of the amradillo family have as yet 
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unknown functions. (Wang. Y. X. ai. ^ } development. Some are specific 

^t::::z:::z^^^^ - - ^-^^ - 

;,«g27500 s/.ov„ng ftomo W to Kfn.s/n Ug^M ^ ^ approximate^ 
.0 KOa ana /^^^ 

dependent on the presence of ATP and miao ^^^^^^ 

Jding to microtubules (MTs, when no. '-^^ ^^^^ „e tai, and motor 

the flux of vesldes between the dffferent cell membrane compartments. 

Plant receptor-like kinases (RLKs) are transm ^ 3trtKi„g 

exUBcellular domains and carboxyHermina, "^^^^^^ ^^^^^ as ep.dem,a, 

, resemblance in domain o^anization to tl,e f ^ ^„.,„, „„e than 600 

gmwth factor receptor. The recently sequenced ^--^^""^ ^ still 

RUK homo,ogs.Mhough only a handful Of these genes ^av^^^^^^^ ^^^^^^ 

have identified iigands or downstream targets ^^^^^ 
CU.VATA1, Brassinosteroid J'"-*;— ,y — of this large gene 

,5 provide much-needed infom,atlon °" Jeiopment, disease resistance, 

family. RUKs control a 7 J^J^^ ^^.^^^^^^^^^^ the expression studies and 

TTrR gene has been described by KrocynsKa, B. et al. (1999). 
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At2g30590 showing homology to WRKY transcription factor (Toll/interleu/<in-1 receptor- 
like protein) 

The sequence shows homology to tomato Cf-9 resistance gene Avr9/Cf-9 rapidly elicited 
protein 4 (NL27) (Hehl, R. et al. (1998)). WRKY proteins are a large group of transcription 
factors restricted to the plant kingdom. WRKY proteins are a recently identified class of DNA- 
binding proteins that recognize the TTGAC(C/T) W-box elements found in the promoters of a 
large number of plant defense-related genes (Dong and Chen, 2003). It has been found that 
the majority are responsive both to pathogen Infection and to salicylic acid. The functions of all 
other VVRKY genes revealed to date involve responses to pathogen attacl^, mechanical stress, 
and senescence (Dong and Chen, 2003). 



At1g80530 showing homology to Nodulin 

Infection of soybean roots by nitrogen-fixing Bradyrhizobium japonicum leads to expression of 
plant nodule-specific genes known as nodulins. Nodulin 26, a member of the major intrinsic 
protein/aquaporin (AQP) channel family, is a major component of the soybean symbiosome 
membrane (SM) that encloses the rhizobium bacteroid. These results Indicate that nodulin 26 
is a multifunctional AQP that confers water and glycerol transport to the SM, and likely plays a 
role in osmoregulation during legume/rhlzobia symbioses (Dean et al. (1999). Rice (Oryza 
sativa van Nipponbare) possesses two different homologues of the soybean eariy nodulin 
gene GmENOD93 (GmN93), OsENOD93a (homology of 58.2% to GmENOD93), OsENOD93b 
(homology of 42.3%). In intact rice tissues, OsENOD93b was most abundantly expressed in 
roots and at much lower levels in etiolated and green leaves, whereas the expression of 
OsENOD93a was very low in roots and etiolated leaves, and was not detected in green leaves. 
The level of OsENOD93a expression was enhanced markedly in suspension-cultured cells, 
whereas that of OsENOD93b did not increase (Reddy et al. (1998)). Homologues of genes that 
are produced in response to infection of soybean roots by bacteria are also present in other 
plants such rice. Their function is largely unknown, some functional homologues are identified 
such as a water channel involved in osmoregulation. 

At2g34770 showing homology to Fatty acid hydroxylase 

This gene has been described in Matsuda et ai. (2001). A common feature of the membrane 
lipids of higher plants is a large content of polyunsaturated fatty acids, which typically consist 
of dienoic and trienoic fatty acids. Two types of omega-3 fatty acid desaturase, which are 
present in the plastids and in the endoplasmic reticulum (ER), respectively, are responsible for 
the conversion of dienoic to trienoic fatty acids. To establish a system for investigating the 
tissue-specific, and hormone-regulated expression of the ER-type desaturase gene (FADS), 
transgenic plants of Arabidopsis thaliana (L.) Heynh. containing the firefly luciferase gene 
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/1 1 \C\ fused to the FADS promoter (FAD3..LUC) were consu uoi 

(LUC) tusea lo me r h desaturase is regulated through synergistic 

this report suggest that the expression of ER-type desaturase is a 

tnis repori buyy hormonal regulation and the tissue 

and antaaonistic hormonal interactions, and that sucn normoiw 

and antagonisiic non „„dif,ed in accordance with the growth 

specfficity of the expression of this gene are further modified in ac 
pHase in plant development (We.lesen K, et al. (2001): Kachroo P. at al. (2001). Kahn. R. A. 
al. (2001); Smith, M. et al. (2000)). 

jinaa402 showing homology to CinnamoylCoA reductase 

At2g43iOisnomna «> transcript is expressed in lignified organs, 

rrR enzvme is involved in lignifioation. The CCR transcnpi a o y 

OCR eozyme is in ,^,„^^ mainlv in voung differentiaOng xylem. Also, 

i.e. root and stem tissues, and is localized mainly in young ^^a,vses the 

^enolignols may be precursors of end products other tt^an gnins. ^^^^ 
conveln o. cinnamoyl-CoAs into their corresponding cinnamaldehydes, ^-^^J^^^^ 
the phenylp^panoid pathway specifically dedicated to the monoiignol biosynthet. bran^^-^e 
two genes are d«eren«a.ly exp^ssed duhng development and in response to iMe^ion. 
WCCR1 is preferentially expressed in «ssues undergoing ligrtlflcation. In contrast, AtCCR2, 
ooTy expressed du.ng development. Is strongly ar,d — induced ^^^^^^^^^^^ 
.compatL interaction ..h Xanthomonas campestris P>; J— ^ 

hypersensitive ^^^^^^^^^ whose 

constitutive lignification whereas AtCCR2 is invoiveo in mo y . . , H..rmr, 

consiiiuuve uyi nciMW This oroten is involved dunng 

accumulation may lead to resistance (Lauvergeat et al. (2001)). This pro 
development, increase in growth diameter, lignification of vascular strands and interfascicular 



fibers. 



At2Q47440 showing homology to Tetratricopeptide repeat protein 

functions, me ir ^^.^ sequence). It 

Different proteins may contain from 3 to 16 tandem 1 f-K mo« 

has been shown that some proteins contain a TPR repeat er^ cell cycle regulated. 

30 M3g237m showing homology to Receptor like kinase TMK . , ,„i„o add 

=7 . . J ,11 ^ 19 .nhdomains and invanant amino acia 

The kinase domain of NtTMKI contained all of 12 subdomain 

residues found in euKaryotic protein kinases. The extracellular doma, conta^ed 1^ louche 
rich repeats which have been implicated in protein-protein interactions. The amino acrf 
uln^e Of ..TMK1 exhibited high homology with tho. . 
35 rice in both kinase and extracellular domains, suggesting that NtTMKI is a TOK horn g 

..accc. The .TMKI transcripts were prese. . . I 
different developmental stages in anthers and floral organs, ini.. 
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leaves was stimulated by CaCE, methyl jasmonate, wounding, fungal'SrcSnfand 
*ltosan. The NtTMKI mRNA level also incrBased upon infeo^on with tobacco mosaic 'virus 
(Cho and Pal (2000)). This pnjtein Is involved In different aspects of development and disease 

resistanria 



At3g61460 showing homology to RING H2 

RING-finger proteins contain cysteine-rich, zincbinding domains and are involved in the 
formation of macromolecular scaffolds important for transcriptional repression and 
ubiquitinatlon. RING H2 act as E3 ubiquitin-protein ligases and play critical roles In targeting 
10 the destruction of proteins of diverse functions in all eukaryotes, ranging from yeast to 
mammals. The Arabidopsis genome contains a large number of genes encoding RING finger 
protems. A small group is constituted by more than 40 RING-H2 finger proteins that are of 
small s,ze. not more than 200 amino acids, and contain no other recognizable protein-protein 
.nteraction domain(s). This type of genes is very important for several aspect of development 
15 regulation of developmental proteins, cell cycle proteins. 

At4900730 showing homology to Homeodomain AHDP (antocyaninless 2) 

This .s a homeodomain transcription factor; similar to ATML1 and is very conserved and has 
ep,demiis specific expression. This sequence shows also homology to Zee mays mRNA for 
^0 OCL3 protein (Ingram, G. C. et al. (2000)). 

At4g13940 showing homology to adenosylhomocysteinase (Glutathione dependent 
fomtaldehyde dehydrogenase) 

Glutathione-dependent formaldehyde dehydrogenase was described in Sakamoto A et al 
(2002), Arabidopsis glutathione-dependent formaldehyde dehydrogenase is an S- 
n-trosoglutathione reductase. S-Nitrosoglutathione (GSNO), an adduct of nitric oxide (NO) with 
glutathione, is known as a biological NO reservoir. Heterologous expression in Escherichia coli 
of a cDNA encoding a gfutathione-dependent formaldehyde dehydrogenase of Arabidopsis 
ha ana showed that the recombinant protein reduces GSNO. The identity of the cDNA was 
30 further confimied by functional complementation of the hypersensitivity to GSNO of a yeast 
mutant with impaired GSNO metabolism. This is the first demonstration of a plant GSNO 
reductase, suggesting that plants possess the enzymatic pathway that modulates the 
bioactivity and toxicity of NO. 

35 At4g35050 showing homology to WD40 MSI3 

Members of the MSI/RbAp sub-family of WD-repeat proteins are widespread in eukaryotic 
organ,sms and fomi part of multiprotein complexes that are involved in various biological 
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Th. Zea -^J'^ ^ Ras/cAMP pathway ,n yeast. The 

Determination of its exp^ssion pattern dunngma^e seed development re 

regulates G1 progression an RbAp46, share sequence 

r ;ito:Lt::^^^^^ P-way t.a. ^.ates oe„u,ar growth. Human 

...type as we,, as ripenin. in«or ^^^ZZZ':::^^: " - - 

to the human RB protein and the RB-like RKbi proie 

interaction is conserved between plants and animals (Ach et al. (1997)). 

At4g36670 showing homology to Sugar transporter , - u The ERD6 is 

re,ated to sugar transporter of bactena, yeas p hydrophilic 
.vea,ed that ERD6 protein Has 12 putatK,e ^-'^"'^^"^ '^'^^'^^^^ Imbrane-spanning 
.gion. Secuenees that a. consen,ed at the en s o, the 6th and 1 ^h m ^ 
domains of sugar transporters are aiso present ,n ERD6. ERD6 gen^ 
0 ™,tigene famiiy in the A« W genome. The expression o^ me ERD6 gene 
not oniy by dehydration but also by oold treatment (Kiyosue et al. (1998)) - 

.«,„«.O..o./„./,o,noM..oUp,--^^^^^^ ^ ^ , 

Nonspecific lipid transfer proteins (LTPs) from pianis ar ^,^,«.,<,o these proteins 

,3 s,muL Phospholipid transfer between membranes . ^-^-^^^^^^ '^'^Z^ 
sre generally located outside Cthe plasma membrane, ,t ,s unl^ely ha^ ^ 
vivo. The LTP1 promoter was active early ,n development .n protod 
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embryos, vascular tissues, lignified tips of cotyledons, shoot meristem, and stipules. In adult 
plants, the gene was expressed in epidermal cells of young leaves and the stem. In flowers, 
expression was observed in the epidermis of all developing influorescence and flower organ 
primordia, the epidermis of the siliques and the outer ovule wall, the stigma, petal tips, and 

5 floral nectaries of mature flowers, and the petal/sepal abscission zone of mature siliques. 
Consistent with a role for the LTP1 gene product in some aspect of secretion or deposition of 
lipophilic substances in the cell walls of expanding epidermal cells and certain secretory 
tissues! The LTP1 promoter region contained sequences homologous to putative regulatory 
elements of genes in the phenylpropanoid biosynthetic pathway, suggesting that the 

10 expression of the LTP1 gene may be regulated by the same or similar mechanisms as genes 
in the phenylpropanoid pathway (Thoma, S. et al. (1994)). More background knowledge to this 
type of genes can be found in the following references: Clark, A. M. et al., (1999); Toonen, M. 
A. et al. (1997); Molina, A. (1997); Thoma, S. etai. (1994). 

1 5 At5g02820 showing homology to SPO like 

Plant steroid honnones, such as brassinosteroids (BRs), play important roles throughout plant 
growth and development. Plants defective in BR biosynthesis or perception display cell 
elongation defects and severe dwarfism. Two dwarf mutants named binS and binS with 
identical phenotypes to each other display some characteristics of BR mutants and are 

20 partially insensitive to exogenously applied BRs. In the dark, bin3 or bin5 seedlings are de- 
etiolated with short hypocotyls and open cotyledons. Light-grown mutant plants are dwarfs with 
short petioles, epinastic leaves, short inflorescence stems, and reduced apical dominance. 
BINS and BINS were cloned and show that BINS is one of three putative Arabidopsis SP011 
homologs (AtSP011-3) that also shares significant homology to archaebacterial 

25 topoisomerase VI (TOP6) subunit A, whereas BINS represents a putative eukaryotic homolog 
of TOP6B. The pleiotropic dwarf phenotypes of binS establish that, unlike all of the other 
SP011 homologs that are involved in meiosis, BINS/AtSP011-3 plays a major role during 
somatic development. Furthermore, microarray analysis of the expression of about 5500 genes 
in bins or binS mutants indicates that about 321 genes are down-regulated in both of the 

30 mutants, including 18 of 30 BR-induced genes. These results suggest that BINS and BINS may 
constitute an Arabidopsis topoisomerase VI that modulates expression of many genes, 
including those regulated by BR? (Yin Y et al. (2002)). More background information on this 
type of gene can be found in th--. following references: Soustelle, C. et al. (2002); Kee, K. and 
■ Keeney, S. (2002); Hartung. F. and Puchta, H. (2001); Grelon, M. et al. (2001). 

35 
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At5g14420 showing homology to copine I (phospholipid binding protein) 

The copines are a newly identified class of calcium-dependent, phospholipid binding proteins 
that are present in a wide range of organisms, including Paramecium, plants. Caenothabditis 
elegans mouse, and human. However, the biological functions of the copines are unknown. A 
humidity-sensitive copine mutant was made in Arabidopsis and under non-pemiissive. low- 
humidity conditions, the cpn1-1 mutant displayed aberrant regulation of cell death that included 
a lesion mimic phenotype and an accelerated hypersensitive response (HR). However, the HR 
in cpn1^1 showed no increase in sensitivity to low pathogen titers. Low-humidity-grown cpn1-1 
mutants also exhibited morphological abnormalities, increased resistance to vimlent strains of 
Pseudomonas syringae and Peronospora parasitica, and constitutive expression of 
pathogenesis-related (PR) genes. Growth of cpn1-1 under permissive, high-hum.drty 
conditions abolished the increased disease resistance, lesion mimic, and morphological 
mutant phenotypes but only partially alleviated the accelerated HR and constitutive PR gene 
expression phenotypes. The disease resistance phenotype of cpn1-1 suggests that the CPN1 
gene regulates defense responses. Alternatively, the primary function of CPN1 may be the 
regulation of plant responses to low humidity, and the effect of the cpn1-1 mutation on disease 
resistance may be indirect (Jambunathan et al. (2001)). Arabidopsis growth over a wide range 
of temperatures requires the BONZA11 (BON1) gene because bon1 null mutants make 
miniature fertile plants at 22-C but have wild-type appearance at 28»C. The expression of 
20 B0N1 and a BON 1 -associated protein (BAP1) is modulated by temperature. Thus B0N1 and 
BAP1 may have a direct role in regulating cell expansion and cell division at lower 
temperatures. BON1 contains a Ca(2+)-dependent phospholipid-binding domain and is 
associated with the plasma membrane. It belongs to the copine gene family, which .s 
consented from protozoa to humans. The data here obtained suggest that this gene family 
25 may function in the pathway of membrane trafficking in response to external condrtions (Hua et 
al (2001)) The major calcium-dependent, phospholipid-binding protein obtained from extracts 
of Paran7ec«/m tetraurelia, named copine. had a mass of 55 kDa, bound phosphatidylserine 
but not phosphatidylcholine at micromolar levels of calcium but not magnesium, and promoted 
lipid vesicle aggregation. Current sequence databases indicate the presence of multiple cop.ne 
30 homologs in green plants, nematodes, and humans. The full-length sequences reveal that 
copines consist of two C2 domains at the N temiinus followed by a domain similar to the A 
domain that mediates interactions between integrins and extracellular ligands. The association 
with secretory vesicles, as well the general ability of copines to bind phospholipid bilayers in a 
calcium-dependent manner, suggests that these proteins may function in membrane trafficking 
35 (Creutzetal. (1998)). 
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At5g49160 showing homology to cytosine methyltransferase 

DNMT3L is a regulator of imprint establishment of normally methylated maternal genomic 
sequences. DNMT3L shows high similarity to the de novo DNA methyltransferases. DNMT3A 
and DNMT3B. however, the amino acid residues needed for DNA cytosine methyltransferase 
activity have been lost from the DNMT3L protein sequence. Apart from methyltransferase 
activity Dnmt3a and Dnmt3b serve as transcriptional repressors associating with h.stone 
deacetylase (HDAC) activity. DNMT3L can also repress transcription by binding d.rectly to 
HDAC1 protein. PHD-like zinc finger of the ATRX domain is the main repression motif of 
DNMT3L. through which DNMT3L recruits the HDAC activity needed for transcnptional 
silencing DNMT3L as a co-repressor protein and suggest that a transcriptionally repressed 
chromatin organisation through HDAC activity is needed for establishment of genomic .mpnnts 
(Aapola et al (2002)). More background information on this type of gene can be found in 
Chen T et al, (2002); Bartee. L. and Bender, J. (2001); Freitag M. et al. (2002). in Ambidops.s 
a SW12/SNF2 chromatin remodeling factor-related protein DDM1 and a cytosine 
5 methyltransferase MET1 is required for maintenance of genomic cytosine methylation. 
Mutations in either gene cause global demethylation. There are also effects of these mutations 
on the PAI tryptophan biosynthetic gene family, which consists of four densely methylated 
genes an-anged as a tail-to-tail inverted repeat plus two unlinked singlet genes. The 
methylation mutations caused only partial demethylation of the PAI loci: ddm1 had a strong 
0 effect on the singlet genes but a weaker effect on the inverted repeat, whereas met1 had a 
stronger effect on the inverted repeat than on the singlet genes. The double ddm1 met1 
mutant also displayed partial demethylation of the PAI genes, with a pattern similar to the 
ddm1 single mutant. To detemiine the relationship between partial methylation and expression 
for the singlet PA12 gene a novel reporter strain of Arabidopsis was constructed, in which PAI2 
silencing could be monitored by a blue fluorescent plant phenotype diagnostic of tryptophan 
pathway defects. This reporter strain revealed that intemiediate levels of n^ethylation correlate 
with intermediate suppression of the fluorescent phenotype. Other background infomiation can 
be found in Finnegan, E. J. and Kovac K. A. (2000). Plant DNA methyltransferases. DNA 
methylation is an important modification of DNA that plays a role in genome management and 
30 in regulating gene expression during development. Methylation is carried out by DNA 
methyltransferases which catalyse the transfer of a methyl group to bases within the DNA 
helix Plants have at least three classes of cytosine methyltransferase which differ in protein 
structure and function. The METI family, homologues of the mouse Dnmtl methyltransferase, 
- most likely function as maintenance methyltransferases. but may also play a role in de novo 
methylation The chromomethylases. which are unique to plants, may preferentially methylate 
DNA in heterochromatin; the remaining class, with similarity to Dnmt3 methyltransferases of 
mammals are putative de novo methyltransferases. The various classes of methyltransferase 
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mav show differential activity on cytosines in different sequence c . 

ra preferentialiy methy.ate cytosines in CpNpG sequences while the Ars.,.ops,s MET 
TeUransferase Shows a preference for cytosihes in CpG sequences. Additional prote.ns. ^ 
Taie DDM1. a member of the SNF2/SW.2 family of chror^atin remodel.ng prote.ns. are 
also required for methylation of plant DNA. 

;„5g54940 .^.,n, homo/ogy ,o Trans^on ,n«,a«on fe.tor W«ona, ,n«,a«o„ 

:i«sls .as no. .en cons... .o .e — ta. ,n .he con.., o, ..I 
However, da,a are emerging on .he involvemen. of .his process J^^^^ 
.umoriaenesis Pro.ein b,osyn.hesis is a central process in all liv.ng cells. I. is one ot .he 
r; r—sion o^geneuc in,or.3.,on s.orea in ONA on .he - 
are produced .o .a,n.ain .he speCc .unCion o, a given ce . ^^^^^^^^^^ 

p,a» on riboson^l pa,«Ces where ,he gene.. .n,orma.ion .ranscnbed 
into orcein The process of pratein synmesis on me ribosome cons,s.s of .hree phases^ 
r lo"a.i:n and .ermlnaHon. Brassinos.ero«s (BRs) regu,a.e - exPress.on o 
numerous genes assoc,a.ed ^ plan. aevelopn,en., and re<,u,,B ^^^^^j'^J^^ 
receptor .nase .o realize .heir effecte. In animals, *e .ransformrng 
t,eta ,an,i,v o, peptides aCs via Ser^r receptor Kinases .o have a 
Jm^,. involved in animal development and adult hon,eostas,s. TGF-bete recep.or 
rceln (TH,P-f ) was previously .own by o.ers .o be an '^-J^^J^^ 
me TGF.be.a .ype M recep.or Kinase which plays an impor.an. role ,n -f^-^-^^ ^^f^^^ 
TRIP 1 is a Wl>repea. protein *a. also has a dual role as an essenfal subun« of the 
IZZ inlon facor elP3 in animals, yeas, and p,an.s, -by^-- - ; 
pute.L linK between a developmen.al s«naling pa.hway and .he con.ro, of ^^'^^T^^Z 
I yeast expression of a TR,P-1 « has also been closely assocated w, h cen 
ro ^ln 1 progression through me eel, cyC. Transcrip. levels of ^^^^^^^^^ 
Plants are regulated by BR feaWtent under a variety of conditions, and transgen c plants 
plants are regula y ^ developmental defects, including 

expressing antisense TRIP-1 ffl, correla.ive 

' -::Z:^:ZZ1 :r repo.ed d., f.nc«ons ,n ver.ebra.es ^ 
Z mlOhtLiate some of the molecular mechanisms ^t,: 

L- o nenpffli role in development, association between 
5 eu™. This complex may have a g ne. role ,n d P ^ ^^^^^^ 

components of me COPS ^'^^^^ j^^'^^^ also as ,NT-6) and e,P3c 

eukaryotic translation initiation factor 3 {elF3). elP Je tp^o. 
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(p105). AtelFSe coimmunoprecipitated with CSN7, and elF3c coimmunoprecipitated with 
elF3e, e!F3b. CSN8, and CSN1. elFSe directly interacted with CSN7 and elF3c. elFSe and 
elF3c are probably components of multiple complexes and that elF3e and elF3c associate with 
subunits of the C0P9 signalosome, even though they are not components of the COP9 
signalosome core complex. This interaction may allow for translational control by the COP9 
signalosome (Yahalom et al. (2001)). 

At5g56740 showing homology to Histone acetyl transferase HATB 
Transforming viral proteins such as E1A which force quiescent cells into S phase have two 
essential cellular target proteins, Rb and CBP/p300. Rb regulates the G1/S transition by 
controlling the transcription factor E2F. CBP/p300 is a transcriptional co-activator with intrinsic 
histone acetyl-transferase activity. This activity is regulated in a cell cycle dependent manner 
and shows a peak at the G1/S transition. CBP/p300 is essential for the activity of E2F. a 
transcription factor that controls the G1/S transition. It was found that CBP HAT activity is 
15 required both for the G1/S transition and for E2F activity. Thus CBP/p300 seems to be a 
versatile protein involved in opposing cellular processes, which raises the question of how its 
multiple activities are regulated (Ait-Si-Ali, S. et al (2000)). The BRCA2 is a histone 
acetyltransferase. Two potential functions of BRCA2 were proposed which includes role in the 
regulation of transcription and also in DNA repair. Forty-five-amino acid region encoded by 
20 exon 3 of BRCA2 was shown to have transcriptional activation function. Recent studies of the 
several enzymes Involved in acetylation and deacetylation of histone residues have revealed a 
possible relationship between gene transcriptional activation and histone acetylation. Since 
BRCA2 appear to function as a transcriptional factor, Histone acetyl transferase (HAT) activity 
of BRCA2 was tested. Also, evidence that BRCA2 has intrinsic HAT activity, which maps to the 
25 amino-tenninal region of BRCA2, was presented. It was demonstrated that BRCA2 proteins 
acetylate primarily H3 and H4 of free histones. These observations suggest that HAT activity of 
BRCA2 may play an important role in the regulation of transcription and tumor suppressor 
function (Siddique et al. (1998)). These types of genes are very important for regulation of 
genes involved in development, cell cycle control, and chromatin structure. 



30 



35 



At5g61520 showing homology to STP3 sucrose transporter 

For developing seeds of grain legumes, photoassimilates released to the seed apoplasm from 
maternal seed coats are retrieved by abaxial epidermal and subepidemrjal cells (dermal cell 
complexes) of cotyledons followed by symplasmic passage to their underlying storage 
parenchyma cells. In some species, the cells of these complexes differentiate into transfer 
cells (e.g. broad bean and pea, Patrick and Offler,2001). Sucrose is a major component of the 
photoassimilates delivered to cotyledons (Patrick and Offler, 2001; Weber et al.. 1997b). 
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""Tlrter (SUT) genes have been doned. and funrtionaUy characterised as 
Sucrose transporter (SUl) genes na « /rci iti Wphf^r et al 

sucrose/H. symporters, from developing cotyledons of broad bean (VfSUTI, Weber et al., 
rind pea (PSSUT1, Tagederet al. 1999). SUTs and P-.,pe H.-ATPases have bean 

In t 1 ocaLlo plasn^a membranes of den.a, eel, comptexes In developing cotyledons 
shown to co-localize to p ^^^^ ^^^^ ^ 

; of broad bean (Harnngton et al., 1997, Weber et ai., „,.h„n,» calls but is 

2000). in contrast, for pea cotyledons, SUT Is also present in storage P— ^J^^ 
4.J less ac«e than SUT(s) localized to epidermal transfer ceils (Tegeder et ai., 1999). 
These type of genes are Important for seed filling. 

en—, .e. However, t^e p... 

rical f.c.,:ncs) of .ost COPKs remains ^^^^^^Z::^:^ 

:::,o: r r i^r ::rofZ c^^s j .vie^.. . 

ringllular and genomic technologies with gene«c and biochemica, approaches, he 
rrn^tlon Of COPKS provides a valuable opponun«y to '^^^^^ 

oaloium-slgnailng network (Cheng et a,., 2002). These type of genes are Important 
20 sqnaUng. 

M2a25970 Showing homology to KH RNA binding domain 

Zvic and Barta (2002) described RNA recognition met. (RRM) and ^^^^^ 
domain Ri^-blnding proteins from the flowering plan, .ra./dops. ^^^^"^^ZZ^Z 
,5 sp^d motifs are the RNA recognition motff (RRM) and the K ^o-lcgy KH dom a ^ 
Lidopsis genome encodes 196 RRM-containing proteins, a more complex set than found 
C— rriregans and O^sop* me/anogas<e.. .n addition, the genom 
"a KH domain proteins. Most of the .ra^ops. -^-^^^'l^^ 
classifled into structural and/or functional groups, based on s,m,lar*y w,th either known 
30 "n Lb«op./s proteins Approximately 50% o, A..«<^«s RRM— g - 
: Thave Obvious' omologues In metazoa, and for most of -^ ^-^-^^^^ 
o^olcgues of metazoan proteins, no experimental data exist to conflrm '"^^ 
tur^tio of most Ara./dops/. RRM proteins and of ail KH proteins ,s 
■ co^plexHy Of RNA-binding proteins In ..a.*ps,s, as evident .groups « 
35 and polv(A>binding proteins, may account for the obse.ed '^^'^^^^"'ZIm 
behveen plants and metazoa. The function of th,s type of genes ,s largely unknown. 
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be related to PUMILIO genes from DrvsophHa. Important for regulation of gene expression at 
the post-transcriptional level, role In development, stress tolerance. 



At3g07800 showing homology to Thymidine Idnase 

5 This type of thymidine kinase genes is cell cycle regulated, E2F regulated, is responsible for 
production of thymidine triphosphate. This type of gene plays a role as a precursor for DMA 
synthesis and is therefore a marker of S phase. 

At5g47?70 showing homology to Homeobox leucine zipper protein 
10 This type of homeobox genes is important for development and growth and also for stress 
tolerance. At5g47370 is homeobox-leucine zipper protein HAT2 (HD-ZIP protein 2). Homeobox 
genes are known as transcriptional regulators that are involved in various aspects of 
developmental processes in many organisms. Homeodomain transcription factors regulate 
fundamental body plan of plants during embryogenesis, as well as morphogenetic events in 
15 the shoot apical meristem (SAM) after embryogenesis. HOX1 belongs to the subset of 
homeodomain leucine zipper (HD-zip) and is involved in the regulation of vascular 
development (Scarpella et ai, 2000; Meijer et a!.. 2000). The sequences for the rice OsHOXI 
orthologue are deposited in Genbank under the accession number X96681 (cDNA) and 
CAA65456.2 (protein), which sequences are both herein incorporated by reference. 

20 

BAA23337.1 OsMYBI 

MYB-like DNA binding proteins are involved in the control of specific developmental steps in 
different organs. OSMYB1 binds to a seed specific element in the seed storage protein 
glutelin, is expressed in endosperm of rice seeds, and plays an important role during seed 
25 maturation (Suzuki et al. , 1 997). 

BAA89798 OsNAC4 

NAC domain containing genes, such as NO APICAL MERISTEM in petunia and CUP- 
SHAPED COTYLEDON2 and NAP in Arabidopsis, have crucial functions in plant development 
30 (Kikuchi et a!., 2000). These genes are involved in the control of organ primordium delimitation 
and lateral organ development. It has also been recently shown that a member of the NAC 
family of transcription factor can induces fonnation of ectopic shoots on cotyledons (Daimon et 
al., 2003). 

3 5 AAD37699 rice homeodomain leucine zipper protein H0X6 (partial) 

Homeobox genes are known as transcriptional regulators that are involved in various aspects 
of developmental processes in many organisms. Homeodomain transcription factors regulate 
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fundamental body plan of plants during embryogenesis. as well as morphogenetic events in 
the shoot apical meristem (SAM) after embryogenesis. HOX6 is a homologue of the 
Arabidopsis homeobox gene Athb-12 (Lee et a/.. 2001). Athb-12 is a transcriptional activator 
important in regulating certain developmental processes as well as in the plant's response to 
water stress^ involving ABA-mediated gene expression. At3g61890 Is the Arabidopsis 
sequence corresponding with the rice HOX6 sequence of AAD37699. 

AK104073 OsMYB predicted 

This gene is homologous to the Arabidopsis gene CIRCADIAN CLOCK ASSOCIATED (CCA1) 
gene that encodes a related MYB transcription factor, which regulates circadian rhythms 
(Carre et Kim. 2002). This gene as well as the MYB homologue. regulate the period of 
circadian rhythms in gene expression and leaf movements. 

gyamo/e 9: NMR study of P^Fa/DPa overaxpressinq plants. 

In support of the microarray studies identifying the increased or decreased expression level of 
E2F-target genes in E2Fa-DPa overexpressing plants, the effects of E2Fa/DPa overexpression 
on the protein level and ultimately on the level of metabolites were studied via the techniques 
of metabolomics. Metabolomics means qualitative and quantitative analysis of the metabolites 
present at a certain time in a cell culture or a whole biological tissue. Metabolites, as 
designated here, are small molecular weight molecules (typically under 1000 Daltons). of 
which many are already known (such as urea, lipids, glucose or certain small homiones) while 
others are still to be identified. Metabolites are the final product of the protein content of the 
cell The main methods used to detect and quantify of those molecules are mass spectrometry 
or NMR spectroscopy (Nicholson et a!., 2002) after extraction and purification of the 
metabolites from the organism. 

Now NMR spectroscopy on whole organisms has been performed. The recording of spectra of 
the metabolites was possible without any prior purification of the plant material. Hereto, the 
samples were spun at the magic angle. This technique, dubbed "High Resolution Magic Angle 
Spinning" (HRMAS) NMR. has now been used on intact plantlets. 1H-13C HSQC spectra were 
recorded on intact wild-type and E2Fa/DPa overexpressing plantlets of Arabidopsis thal.ana, 
and monitored the changes in metabolite pattern. From the spectra, a shift in the metabolome 
of E2Fa/DPa overexpressing plants when compared to wild-type plants, was observed. These 
spectra are processed in order to map the observed metabolic differences. 
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WO 2004/035798 . . . .^.u:^^,^:^!^ pl^ n*^ fransformed With 
Pyamn/e 10: molecular and p hi^noti/Dic analysis of Arabidopisis pianisj i 

the genes pccordina to t he present invention 

Arabidopsis miiana plants are transformed with at least one of the genes of the present 
invention as presented in Table 4 or 5. operably linked to a plant promoter. 
In one example. Arabidopsis plants were transformed with the genes as presented in Tabte 6^ 
The vectors used ware derived from the expression vector pK7WGD2. carrying the CaMV35S 
promoter for expression of the gene. For transformation, the flower dip method described by 
Bechtoid and Pelletier (1998) was used. 



Table 6: Genes that were selected and transfomied into Arabidopsis 
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12 


At5g56740 


Histone acetylase HAT B 


348 + 349 


OK 


OK 


OK 


OK 


13 


At3g24320 


Putative mismatch binding 
protein 


350 +351 


OK 


OK 


OK (13,4) 


OK 


14 


At4g00730 


Antliocyaninless2 


352 + 353 










15 


At1g23O30 


arm-repeat containing protein 


354 + 355 


OK 


OK 


OK(clone11) 


OK 


16 


At5g54380 


receptor-protein i<inase-like 
protein 


356 + 357 










17 


At1g72180 


putative leucine-rich receptor- 
like protein l<inase 


358 + 359 


OK 


OK 


OK 


OK 


18 


At1g61100 


Unknov\/n 


360 + 361 


OK 


OK 


OK (18,1) 




19 


At2g25970 


Unknown 


362 + 363 


OK 


OK 


OK 


OK 


20 


At2g38310 


Unknown 








OK 


OK 


21 


At3g45970 


Unknown 


366 + 367 


OK 


OK 


OK (21,3) 


OK 



Code: internal reference code of the gene; AGI: accession number of the protein in the internal 
dataset, here with reference to the MIPS database accession number, Gene: name of the 
protein; primers: PCR primers used to isolate the ORF of the gene by RT-PCR using cDNA; 
5 prepared form E2Fa-DPa overexpressing plants; PCR: PCR completed successfully; 
pDONR207: cloning in this vector completed (www.invitrogen.com); pK7WGD2: cloning of the 
genes in the vector under control of the CaMV 35S promoter (Karimi et ai, Trends Plant Sci. 
2002 May;7(5):193-5): Flower dip: transfonnation of Arabidopsis plants with the pK7WGD2 
vector 

10 

The transformed Arabidopsis plants are evaluated as described below. 

After molecular analysis (PCR. RT-PCR, Western-blot, southern-blot, Northern blot, NMR), the 
plants with modified E2F target gene expression levels, are submitted to phenotypic analysis. 
Special attention is given to root growth and leaf development. 

15 

The root of A. thaliana. which has a rather constant diameter and rather uncomplicated radial 
symmetry, is a perfect model system for studying and determining the effects of modulation of 
expression levels of an E2F-target on an intact, growing tissue. 

20 The root of A. thaliana comprises a thick unicellular layer of the epidermis cells, one of cortex 
cells, one of endodermis cells and one of pericyclus cells that circumvent the vascular tube. 
Because of its transparency, the root of A. thaliana. these cellular layers can be visualized by 
interference contrast microscopy. By this means the origin of the cells in a specific cell layer 
can be traced back to a set of dividing mother cells in the meristem (Dolan et al., 1993). By 

25 measurement of the cell length of a specific cell layer in function of the distance to the root tip. 
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and the rate of movement of the cells away from the root tip (measured via time-laps 
photography), it is possible to determine the contribution of both the cell elongation as well as 
of cell division to the total root growth (Beemster and Baskin, 1998). 

5 The effects of the E2F-target overexpression in the leaves is determined via microscopic 
techniques after clearance of the leaves of lactic acid. This analysis is performed on the first 
developed leaf pear, since this leaf pear is most comparable between different plants. By 
measurement of the cell number and the number of epidermal cells at different time points 
during leaf development, it is deduced when the leaf cells stop to divide, when they start to 
10 differentiate, the duration of their cell cycle is, and their final cell size (De Veylder et al., 2001 a 
and b). Moreover, this method allows the analysis of the effect of E2F target overexpression on 
the formation of stomata. 



The effect of the E2F-target overexpression is also studied via biochemical means. Functional 
15 assays are developed for the specific enzymatic activity of the studied E2F-target gene. These 
functional methods are based on expression of a reported gene in case the E2F-target is in 
itself a transcription activator or repressor. Functional assays are based on the incorporation of 
radioactive nitrogen or radioactive carbon or other radiolabelled metabolites when the enzyme 
is Involved in the nitrogen or carbon metabolisms or other processes involving metabolites. By 
20 the comparison of the incorporated radioactivity between the control line and the transgenic 
line, the enzymatic activity of the E2F-target can be measured. 

Functional assays are based on the incorporation of radioactive ATP, radioactive purines or 
pyrimidines when the enzyme is involved in DNA replication and/or modification. Functional 
25 assays are based on labeled carbohydrates when the enzyme is involved in cell wall 
biogenesis, or ATP when the enzyme is fnvolved in processes of the chloroplast, or calcium 
when the enzyme is involved in signal transduction. 

In A. thaliana, besides the mitotic cell cycle also an alternative cell cycle is observed, in which 
30 DNA is replicated in the absence of mitosis or cytokinesis. This so-called endoreduplication 
process occurs often in plants. Until today, the physiological significance of endoreduplication 
is unknown. Possibly, it is a mechanism to increase the number of DNA copies per cell, which 
allows more transcription. In support of this hypothesis, endoreduplication often occurs In cells 
- with high metabolic activity (NagI, 1976). However, as a consequence of endoreduplication the 
35 cells are bigger, which is especially useful for increasing yield of cytoplasmatic component, for 
example storage proteins of the seed cells. 
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TO IdyTrcs Of EaF-ta^et overexpression on .he process of endoreduplloaflon .He 
DNA oonten, of ,he oon.ro, p.an^ and .ha transgenic plan, is measured v,a flow-oy^on^e.^. A 
Z dliled analysis is ob.ained by measuring U,e DNA con.en. of individual cells colored 
: :ONA.™ Lorochrome (e.g. DAPI). THe in.ns«y of .he color of .he r,uc eu , 
5 proportion wim Hs DNA con.en.. Re,a«ve DNA.n,easuren,en.s ^" ^^^^^^ 
Lcrodensi.on.e.er. This technique allows de.en.ining a epedfic .ssue .he endo,«^up„ca.on 
pattern of the transgenic plants. 

c^^mn/n n - Use of the invt»nfion in corn 

present invention as presented in Table 4 or 5, for example a gene selected ^^^^^'^ 
2 or a gene selected from the group described in Example 8, or a gene selected from the 
™ppLntedinTableBor7,orahomoiogue thereof such as for exampiea^.^^^^^ 

or a L ortholog, is cloned under control of a promoter operable ,n ma.e ,n a_^ia^ 
15 uansfomiation veCor sui.ed for Ag«,l>ac(erium-media.ed ,ransfom,at,on of com. The^ 
" m i designed for overexpression or for downreguiation. In --es ;f — 
genes selected from Table 5 (downregulated in E2Fa/DP transgen,cs) are o-rexpress d n 
Isgenic com and genes selected from Table 4 (upregu,a.ed in E2Fa-DPa °v-« 
; J) are downregu,a.ed in .ransgenic com. Sui.ab,e promCer for dnv,ng expression of *e 
20 genes of *e present invenSon are as presented in Tables I, II, III and IV or ,n Table V. 

SuLsble promoter for driving expression of the genes of the present Inversion in com are the 
: OOS2 promoter or any other promoter as men«oned herein above, " 
eUlon Of one or more E2F targets according to the present invention are ^^"^^TZ 
v^rs such as the pPZP vector described in Hajdukiewicz et al. ((1994) Plant Mol B,ol 2S. 
" ~oTa s^pelary vector. Vectors and methods .o use .g™.a.o„— .ed 
irLion of ma^e have been described in literature (Ishida e, a/., Nat B,o.echno,. 1996 
"; 45.50-. Frame e, a,., Plant Physiol. 2002 May,129(1):13-22) and are ere 
Zrpoid by reference. Transgenic plants made by these - J" 

30 greenhouse for T1 seed producUon. Inheritability and copy number of •'^^ 1" 
c Ld by cuantnative real-time PCR and Southern blot analysis and express,on lev^s of t 
Zene are detennlned by reverse PCR and Northern analysis. Transgenic ^^^^J^^^J^^ 
copy nsertions of the transgene and with varying levels of transgene ^^^^^^^^^^^^'^ 
. Jt2 seed production. Progeny seeds are genninated and grown ,n the 9'-"^° ^ " 
35 conditions we,, adapted for maize (16:8 photoperiod, 26.28.C ''^^^ '^^^J^^^ l^^^, 
24-C nighttime temperature, as well under water-deficient, nSrogen-defiaent, nd excess NaC 
conditions. Nul, segregants from .he same parenU, line, as well as wild .ype plan.s of the same 
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WO 2004/035798 coifina Of the crosses are 

eu«var are used as con.o,s. T.e progeny ^ ^nrdel^^^^^^ p„. 

evaluated on dWerent growth parameters, such as b o™^ nd de p 

the objective of transfemng the fieW-valldated transgenic trarts ,r,to commerc ^ P 
containing loci) fro. one gennplasn, into another. '^^-^^J^^ J^,^, 

cold). 

,n a particular exan,ple of the present invention, the genes as ,dent,f ed 
, or^h o^oiogue another ^--^ ^ 70,;^ t'oi: "^^^^ 

::::rrrrrr:rt: a..oter tor ove..ession or 
25 downregulation of these genes. 

. r ci:: iZtense orientation and for downregu,a«on a — — ; 

30 wesie, e. a,, ,2001, is .ade. Other ^^^^ - 

^„esareotherconstit.vepro.ote..u ha r^ ^ 

(high mobilKy group protein), or -^^"^ '^^'^^^Z^, p,,3,„. ,„,ention in nee, such as 
promote,, are used to dnve ^^^^^^-^^^ ^^^1°^. J„„„o„e,n), or vegetative tissue 
. for example P^--'-^^;* "^r rRC^^TS ip.asmio malate deshyd^genase); or 

rrrwriro h: p«01.5.. BAB.; or the .ho,e seed (PROSe. proteinase 
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inhibitor), or any other promoter described herein above. Tlie vectors used are plant 
transformation vector suited for /\gro/jacfenum-mediated transformation of rice, such as for 
example binary vectors of the pCAMBIA type or super binary vectors. Such vectors and 
methods for rice transformation have been described in literature by Aldemita and Hodges 
5 (1996) Chan^ef a/. (1993), Hiel et al. (1994) or in EP1 198985 and which teachings herein 
incorporated by reference. 



Table 7: genes (presented by their encoded proteins) selected for rice transformation 



TCDS3435 NP_1 76081.1 At1g57680 (Arabidopsis) 
MPLTKLVPDAFGWTICLVALLVLLGLLCIAYSFYFQSHVRKQGYIQLGY 
FqnPWIIRITFIL FAIWWAVGEIFRLSLLRRHRRLLSGLDLRWQENVCKW 
YIVSNLGFAEPCLFLTLMFLLRAPLKMESGALSGKWNRDTAGYIILYCLP 
MLALQLAWLSESRLNGGSGSYVKLPHDFTRTYSRVIIDHDEVALGTYP 
LLSTILLGVFAAVLTAYLFWLGRQILKLVINKRLQKRVYTLIFSVSSFLPLR 
IVMLCLSVLTAADKIIFEALSFLAFLSLFCFCWSICLLVYFPVSDSMALRG 
LRDTDDEDTAVTEERSGALLLAPNSSQTDEGLSLRGRRDSGSSTQERY 

VFI SLFLEAEN 


SEQ ID NO 1 

and 2 


>CDS3436 BAC42858.1 At3g45730 (Arabidopsis) 

MELPSPYSSRKEESTVPPKRGRVKIMIFRDLVRSETSMAPTPRRGRIKK 
MIAGDLVGSGKQNNYDGDGKRGG 


SEQ ID NO 3 
and 4 


>CDS3449 BAA23337.1 OS MYB1 (Rice) 

MGRSPCCEKAHTNKGANATTKEEDQRLIAYIRAHGEGCWRSLPKAAGLL 

RCGKSCRLRWMNYLRPDLKRGNFTDDEDELIIRLHSLLGNKWSLIAGQL 

PGRTDNEIKNYWNTHIKRKLLARGIDPQTHRPLLSGGDGIAASNKRHHR 

RRIPYP<5RRRRRPRRSSPCEAAAAAAPGRLLGRRLPQQQRHNEHGGA 

AVPRPQPRALGRADAELAAGGDAHQRAAGLPLLPPRLPRRGGVQLSG 


SEQ ID NO 5 

and 6 


>CDS3448 BAA89798.1 OsNAC4 (rice) 

MAAAVGGSGRRDAEAELNLPPGFRFHPTDEELWHYLCRKVARQPLP 

VPHAEVDLYKLDPWDLPEKALFGRKEWYFFTPRDRKYPNGSRPNRAA 

GRGYWKATGADKPVAPKGSARTVGIKKALVFYSGKAPRGVKTDWIMH 

EYRLADADRAPGGKKGSQKLDEWVLCRLYNKKNNWEKVKLEQQDVA 

SVAAAAPRNHHHQNGEVMDAAAADTMSDSFQTHDSDIDNASAGLRHG 

GCGGGGFGDVAPPRNGFVWKEDNDWFTGLNFDELQPPYMMNLQHM 

QMQMVNPAAPGHDGGYLQSISSPQMKMWQTILPPF 


SEQ ID NO 7 
and 8 


>CDS3447 AAD37699.1 OS Homeodomain leucine zipper protein HOX6 
(rice) 

MDGEEDSEWMMMDVGGKGGKGGGGGGAADRKKRFSEEQIKSLESM 


SEQ ID NO 9 
and 10 
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FATQTKLEPRQKLQLARELGLQPRQVAIWFQNKRARWKSKQLEREYS/\ 

LRDDYDALLCSYESLKKEKLALIKQLEKLAEMLQEPRGKYGDNAGDDA 

RSGGVAGMKKEEFVGAGGAATLYSSAEGGGTSSTEQTCSSTPWWEF 
ESE 




>CDS3446 AK1 04073 OSMYB predicted (rice) 

MASIVTATVAAASAWWATQGLLPLFPPPIAFPFVPAPSAPFSTADVQRA 

QEKDIDCPIVIDNAQKELQETRKQDNFEAMKVIVSSETDESGKGEVSLHT 

ELKISPADKADTKPAAGAETSDVFGNKKKQDRSSCGSNTPSSSDIEAD 

NAPENQEKANDKAKQASCSNSSAGDNNHRRFRSSASTSDSWKEVSE 

EGRLAFDALFSRERLPQSFSPPQVEGSKEISKEEEDEVTTVTVDLNKNA 

AIIDQELDTADEPRASFPNELSNLKLKSRRTGFKPYKRCSVEAKENRVP 
ASDEVGTKRIRLESEAST 


SEQ ID NO 11 
and 12 


>CDS3445 NP_565887.1 At2g38310 (Arabidopsis) 
MLAVHRPSSAVSDGDSVQIPMMIASFQKRFPSLSRDSTAARFHTHEVG 
PNQCCSAVIQEISAPISTVWSWRRFDNPQAYKHFLKSCSVIGGDGDNV 
GSLRQVHWSGLPAASSTERLDILDDERHVISFSWGGDHRLSNYRSVT 

TLHPSPISGTWVESYWDVPPGNTKEETCDFVDViVRCNLQSLAKIAEN 
TAAESKKKMSL 


SEQ ID NO 13 


>CDS3444 NP_565703.1 At2g30590WRKY family transcription factor 
(Arabidopsis) 

MEEIEGTNRAAVESCHRVLNLLHRSQQQDHVGFEKNLVSETREAVIRF 

KRVGSLLSSSVGHARFRRAKKLQSHVSQSLLLDPCQQRTTEVPSSSSQ 

KTPVLRSGFQELSLRQPSDSLTLGTRSFSLNSNAKAPLLQLNQQTMPP 

SNYPTLFPVQQQQQQQQQQQQQEQQQQQQQQQQQFHERLQAHHL 

HQQQQLQKHQAELMLRKCNGGISLSFDNSSCTPTMSSTRSFVSSLSID 

GSVANIEGKNSFHFGVPSSTDQNSLHSKRKCPLKGDEHGSLKCGSSSR 

CHCAKKRKHRVRRSIRVPAISNKVADIPPDDYSWRKYGQKPIKGSPYPR 

GYYKCSSMRGCPARKHVERCLEDPAMLIVTYEAEHNHPKLPSQAITT 


SEQ ID NO 15 
and 16 


>CDS3443 NP_849867.1 At1g69510 (Arabidopsis) 
MEDVKGKEIIDDAPIDNKVSDEMESEENAIKKKYGGLLPKKIPLISKDHE 
RAFFDSADWALGKQKGQKPKGPLEALRPKLQPTPQQQPRARRMAYSS 
GETEDTEIDNNEAPDDQACASAVDSTNLKDDGGAKDNIKS 


SEQ ID NO 17 
and 18 


>GDS3442 NP_564615.3 At1g52870 (Arabidopsis) 

MAAASLHTSISPRSFLPLSKPSLKPHRSQILLRNKQRNCVSCALIRDEID 

LIPVQSRDRTDHEEGSVWIVISTETAVDGNESNAA/GFSAATSEGQLSLE 

GFPSSSSSGADLGDEKRRENEEMEKMIDRTINATIVLAAGSYAITKLLTI 

DHDYWHGWTLFEILRYAPQHNWIAYEEALKQNPVI^KMVISGNAA'SVG 


SEQ ID NO 19 
and 20 
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DWIAQCYEGKPLFEIDRARTLRSGLVGFTLHGSLSHFYYQFCEELFPFQ 
DVWVWPVKVAFDQTVWSAIWNSIYFTVLGFLRFESPISIFKELKATFLPM 
LTAGWKLWPFAHLITYGLVPVEQRLLWVDCVELIWVTILSTYSNEKSEA 

RISESVIETSSSSTTTIDPSKE 




>CDS3441 NP_849293.1 At4g02920 (Arabidopsis) 

MIKLCFMTSHGYSIPGLGLPQDLCNTEIiKQNSRSHLVNPGARQEIIPAS 

SFNLNTELLEPWKPVSSFSQFVEIDSAMMKPLLMDVHETAPESLILSFGI 

ADKFARQEKVMEFLLSQSEEFKEKGFDMSLLNELMEFESMKSSSQLRP 

YDTSSVLYLNQELGKPVLDLVRDMMENPEFSVRSNGHVLFSSSSNPEL 

NDLLSIASEFNLSRNSTTKWRQLSPLIPHFQRFESDVFTPAKLKAVTVLA 

PLKSPEKSRLKSPRKHNTKRKAKERDLYKRNHLHAYESLLSLMIGNDH 

RHKHTTVLSLQKSCGELSELLTQFSITAAGTGIAVLFSWCSLASRRVPF 

rANKFFDTGLGLSLVILSWAVNRLREVIVHVNRKANKPCSSLKDDEIlNS 

VERSMKEVYYRAATVIAVFALRFAC 


SEQ ID NO 21 
and 22 


>CDS3440 AAM91 100.1 At1g45200 {Arabidopsis) 

MSKTNMKFCNSYFLVDPTKASFLDLLLLLFSSNLTSARFIDSPPDTLKGF 

RRSFASRWILALAIFLQKVLMLLSKPFAFIGQKLTYWLNLLTANGGFFNLI 

LNLMSGKLVKPDKSSATYTSFIGCSDRRIELDEKINVGSIEYKSMLSIMA 

SKISYESKPYITSWKNTWKMDLVGNYDFYNAFQESKLTQAFVFKTSST 

NPDLIWSFRGTEPFEAADWCTDLDLSWYEMKNVGKVHAGFSRALGL 

QKDGWPKENISLLHQYAYYTIRQMLRDKLGRNKNLKYILTGHSLGGALA 

ALFPAILAIHGEDELLDKLEGIYTFGQPRVGDEDFGEFMKGWKKHGIEY 

ERFVYNNDWPRVPFDDKYLFSYKHYGPCNSFNSLYKGKVREDAPNA 

NYFNLLWLIPQLLTGLWEFIRSFILQFWKGDEYKENWLMRFVRWGIVF 

PGGSNHFPFDYVNSTRLGGLVRPPPTTTPEDKLALIA 


SEQ ID NO 23 
and 24 



Transgenic plants generated by these rice transformation methods are evaluated for various 
growth characteristics. More particularly, the transgenic plants are evaluated and the following 
parameters are monitored: increased total above ground biomass, increased plant height, 
increased number of tillers, increased number of first panicles, increased number of second 
panicles, increased total number of seeds, increased number of filled seeds, increased total 
seed yield (weigth) per plant, increased han/est index, increased thousand kernel weight, 
increased Tmid, increased T45 or A90, increased A42, changed cycling time or an changed 
growth curve, changed flowering time. 

Plants with increase biomass, increased organ number and/or size (including seeds) and or 
any other economically attractive growth characteristics as found by the following plant 
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eva^,::", a. s.ao.ed .o — g the — -ns into —a, 
germplasm. 

. ......n. for TO. T1 ,MT2i»^-ng ^n'- 

:ir: — r .... c... o.a... . a ..n^use ...^^^^ 

. . s.^, e . ^^^^^^^^^^^^^^ 

IIZ Lng the t^nsgena Mlzygotes), a. selected hy ™on«o„ng screena^e 

Terrwlth Improved ag™no™,ca, para.ate. ,n T1 a.e chosen —-on In T2 
. SeeaLches .o. .e pos^ve plants (^th ^^^^ :;^^^^i:Z 

:ro:r:e:=^^ 
rsre:::::::=^^^^ 

..r^.d 60 Dlants per event of which 30 positives for the transgene, and 30 negative, 
are selected, 60 pianis pei cvcin w veaetative growth 

T1 and T2 plants are transferred to the greenhouse and evaluated for vegetative 

parameters and seed parameters, as described hereunder. 



c;t:,fi^f.cat anal>f«^is- t-test and F-test w thP overall evaluation 

^^^^^^^^^^^^^^^ - 

of plant phenotypic characlenstics. An F-test is camea oui y 

31, 0, the plants of a,l of the evenU .ansformed wKh the gene of .ntare t. The F e . 

ou. to checK for an effect of the gene over all «,e .ransfor,.a.on ^'^'^^'^ '^'^J^^^^ 

, tt^M« Qinnifirant data as determmed by tne vaiue 

more than the presence or position of the gene, in me 
significance for a global gene effect is set at a 6% probability level. 

TO ohec. for an effect o, the within an event, i.e., for a ^^^^^^^^^^^ 
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rTflSk probability level. Within one population o. .ransfom,atlon events, some 
t^est IS set at 1 0 ^ probability i- based on the hypothesis that a gene 

events can be under or above this t-tes. thresholdjhis is ba e, yp 

„i3ht only have an e«eC in certain positionsin ^^^^^J^X ^ referred to as 
,„,ion-aependente«ect.n^^^^^^^ 

a line effect of a gene . ^ ,,e F-dls.nbu.lon. The p-va,ue stands for .he 

or alternatively, by oompanng the F-value to the transgene") 
p^bability .ha. .he null hypothesis (null hypothesis being there is no effect 



is correct. 
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20 



VegBtative growth measurements: ^ ^ 

The selected transgenic plants are grown in a ^^^^/^^^^^^^^^^^^ p,3nt. The 

selected .ransgenic plants are grown on soil iu 

environmental setBngs: pho.opehod= 11.5 h, daylight inlens^- ^^ ^"J- " 

.e.perature= 28-C or higher, nigh, .ime ^-P^™'"-" ^ side a. random 

Transgenic plan.s and «,e corresponding -'7^°-;:,^;::; ;; 3 passed several 

of all the plants, using image analysis software. 

„ ;^ve ground plan, area is detem,ined by counting ^ --^J^^^^^^ 
3boveground plant parts disCminated Jrom the '^^^ ^^ J, ,,,3ical 

pi^ures UKen on .he 7^ ^^^^^l^U^^^^ 
ciirfape value expressed in square mm by caiiDrauon. tAH 
pleasured thlsv^ycorrelateswi*^ 

„ P,an. heigh, is de.em,ined by .he distance between the 
cut off at 3 cm above the pot nm. They were tnen co 



uui UM ai »7 — — 

together in the same sheet were counted as one tiller 
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(d) Number of primary panicles. The tallest panicle and all the panicles that overlap with the 
tallest panicles when aligned vertically are counted manually, and considered as primary 
panicles. 

(e) Number of secondary panicles. The number of panicles that remained on the plant after the 
harvest of the primary panicles are counted and considered as secondary panicles. 

(0 Growth curve. The plant area weekly measurements are modeled to obtain a growth curve 
for each plant, plotted as the value of plant area (in mm^) over the time (in days). From this 
growth curve the following parameters are calculated. 

(g) A42 is the plant area at day 42 after sowing as predicted by the growth cun/e model. 

15 (h) Tmid is the time that a plant needs to grow and to reach 50% of the maximum plant area. 
Tmid is predicted from the growth curve model. 

(i) T90 Is the time that a plant needs to grow and to reach 90% of the maximum plant area. 
T90 is predicted from the growth curve model. 
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Seed-related para meter measurements 

The mature primary panicles of T1 and T2 plants are han^ested. bagged, barcode-labelled and 
then dried for three days in the oven at 37«C. The panicles are then threshed and all the seeds 
were collected and counted. The filled husks are separated from the empty ones using an air- 
blowing device. The empty husks are discarded and the remaining fraction is counted again. 
The filled husks are weighed on an analytical balance. This procedure resulted in the set of 
seed-related parameters described below. 

(a) Total seed number per plant is measured by counting the number of husks harvested from 
a plant. 

(b) Number of filled seeds is determined by counting the number of filled husks that remained 
after the separation step. 

(c) Total seed yield per plant is measured by weighing all filled husks harvested from a plant. 

(d) Harvest index of plants is defined as the ratio between the total seed yield and the above 
ground area (mm^), multiplied by a factor 10®. 
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(e) Thousand Kernel Weight (TKW) of plants is a parameter extrapolated from the number of 
filled seeds counted, and their total weight. 

(f) TotalArea Emergence Prop, is the time when plant reach 30 % of their maximum total area 

(g) TotalArea Cycle Time, /s the time when plant reach 90 % of their maximum total area 

Further molecular analysis is performed on the positive plants by techniques well known by 
the person skilled in the art such as for example RT-PCR. 
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. Arabidopsis Genes 2-fold or more upregulated in E2Fa/DPa plants 



Gene Identification 


accession % 


MIPS name 


OLD 

cDN/ 


REF 

\ PRO 


SEQ 

r cdn; 


ID NO 

^ PRO- 


Fold 

induction 

r 


E2Fsite 


Plant homologue 


Unknown function (1^ 
hypothetical protein 


AI998042 


At1g57680 


1 


53 


433 


434 






nee BAB901 59.1, 
mai7P AYin797n i 


putative protein 


A!994686 


At3g45730 


2 


54 


231 


232 


5.14 






putative protein 


AI994734 


At5g66580 


4 


56 






<j. 1 0 






unl<nown protein 


AI999397 


At2g38310 


5 


b/ 


995 


996 


2.79 


TTTGCCC 
G 


rice BAB681 02.1 


unknown protein 


AI995465 


At2g47440 


7 


59 


931 


932 


2.50 






unKnown protein 


AI994871 


At1g76970 


8 


60 


1193 


1194 


2.34 




riceBAB78689.1, 

uUIII rVMDUUU/a. I 


hypothetical protein, 

kinesin 


AI998366 


At1 927500 


9 


61 


393 


394 


2.21 




riceAAL87057.1 


putative protein 


"AI996967 


At4g33050 


10 


62 


383 


884 


2.20 




rice BAB90008.1 


putative protein 


Ai995917 


At3g43690 


12 


64 


C\J\J 




9 -IQ 
<1. 10 






unknown protein, kh 
domain protein 


AI9g3084 


At2g25970 


13 


65 


941 


942 


2.15 




rice BAA9291 0.1, 
maize AY1 06526.1 


unknown protein 


AI993077 


At1 968580 


14 


66 


937 


938 


2.13 




riceBAC00723.1, 
com AAK11516.1 


putative protein, copine 
hvDothetical orotpin 


AI993019 


At5g14420 
All 957990 


15 
16 


67 


205 


206 


2.05 




rice BAB92575.1 


unknown protein - 


Ai997827 


At5g53740 


17 


68 
69 


415 
2/31 


416 
2732 


2.02 
2.01 




rice BAB90042.1 


DNA replication and 
modification (14) 




















putative thymidine 
kinase 


AI997851 


At3g07800 










8.44 




rice AAC31 168.1 


UNA methyltransferase 


AI994691 


At5g49160 










5.37 


ATTGCGG 

C 


riceAAL77415.1, 
com AAC1 6389.1 


IviSiS 


AW004204 


At4g35050 










4.89 


nrcccG 

r\ 

Lf 


:om AAL33648.1 


putative linker histone / 
protein 


M994590 / 


^t3g18035 










3.31 






putative reolication i 
factor c 


UQQ7Q04 i 


^t1g21690 










J.30 

( 


nrcccG 




topoisomerase 6 (■ 
subunit A 


U995290 / 


^t5g02820 








i 


?.62 1 
C 


rrrcccG 




histone H4-like protein /= 


J999171 I- 


^t3g46320 










!.55 1 

c 


mGGGG 




histone acetylase HAT A 
B 


^1998229 /> 


d5g56740 








2 


.36 1 

C 


TTCGCG c 


omAA!\/128228.1 


putative histon H1 A 
histone H2A-like protein A 


1996137 A 
1995882 A 


t1 906760 
t4g2723C 








2 


.27 






putative DNA gyrase A 
subunit A 


1995400 A 


t3g10690 








2 
2 


.23 
.20 


ri 


ceAAD29710.1 


histone H2B-like protein A 


999101 A 


t5g59910 








2 


16 






putative mismatch A 
binding protein 


993280 A 


t3g24320 








2 


10 


ri 

C( 


;eCAD41 187.1, 
)mAAF35250.1 


adenosylhomocysteinas A 
e 


996953 A 
_ 


t4g13940 








2 


07 


C( 


)m AAL33588.1 
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Cell Cycle (2) 




















E2Fa ^ 


J294534 A 


t2g36010 








9 


4.88 






CDKB1;1 C 


)10851 A 


t3g54180 








2 


60 r 

C 


rTCCCG 




Ceil wail biogenesis 


















1 


(11) . -- 
xyiogiucan enao- 1,**- r 

beta-D-glucanase (meii- 


^1994459 A 


t4g30270 








3 


,74 






5) 

putative glycosyl ^ 
ransferase 


U999244 


it1g70090 








3 


.38 




1 


aipna ^ 
galactosyltransferase- 

lilfP nrntfiin 


11998223 f 


^t3g52720 








3 


.26 






INC (Jl Wlwil 1 

putative xyioglucan / 
endotransglvcosylase 


\1999683 / 


y3g23730 










!.85 


r 
( 


ice CAD41426.1, 
;om CAB51 0059.1 


Xyiogiucdn tsi luu- 1 jt / 
beta-D-glucanase-lilte 

nrntpin 


M998301 / 


U4g30280 










>.74 






putative xyioglucan 

porintrfliTialvcQSvlase 


A1994477 


\t1 914720 








. 


2.51 




j 


putative glycosyl 


AI999770 


At1g24170 










2.39 




j 


putative UDP-glucose 
fill If nsvltransferase 


AI997288 


At1g22400 










2.34 


rrrcccG 
c 




putative 

nil irn^vltransferase 


AI998872 


At2g15480 










2.15 






peroxidase 


AI994622 


At2g38380 










2.11 


nCGCG 

c 




beta-1 ,3-glucanase-like 

rvrnfpin 


AI994681 


At3g55430 










2.05 




rice AAB37697.1, 
corn CAB96424.1 


piuicii 1 

Chloropiastic genes 




















(7) 

large subunit of 
ribulose-1,5- 
bisphosphate 
carboxylase/oxygenase 


N96785 


rbcL 






2713 
2715 


2714 
2716 


4.71 
0.54 




NP_051067 

MP 051080 

>il \J\J \ 1 


ribosomai protein L33 
PSil 1 protein 


A1994194 
AW004203 


rpl33 
psbl 






2717 


2718 


Z.ol 




MP 051074 


ribosomai protein L2 


AW004266 


rpl2 






2719 


2720 


2.61 




NP_051099 


ATP-dependent 
protease subunit 


AI997947 


clpP 






2721 


2722 


2.60 




NP_051083 


cytochrome B6 


AI997102 


pets 






2723 


2724 


2.55 






ATPase epsilon subunit 


AW004251 


atpE 






2725 


2726 


2.17 




NP_051065 


A/Iitochondriai genes 




















Lli — 

26S ribosomai RNA 

protein 


AW004275 


orf107a 






2727 


2728 


2.87 




NP_085475 


Transcription factors 
(6) 










1109 


1110 


4.01 




riceBAB92193.1 


LOB domain proUen 41 
■ WRKY transcription 
factor 21 


AI996685 
A1992739 


At3g02550 
At2g30590 


3 


55 


2729 


2730 


9 7R 
2.75 


TTTCCCC 

c 


~ maize AY0721 49 


GATA Zn-finger protein 
Anttiocyaninless2 


Ai995731 
AI993655 


Al3g 16870 
At4g00730 


6 


58 






2.43 


TTTCCCC 
C 




[leucine zipper- 


AI995691 


Al1g07000 


1 













containing protein 




















homeodomain 
transcription factor 
(Atlib-6) 


AI999190 


At2a22430 














com CAB96424.1 


Metabolism and 
biogenesis (11) 




















alcohol dehydrogenase 


AI998773 


At1g77120 










5.09 






putative isocistrate 
lyase 


AI9ggi68 


At3g21720 










3.08 






protoclilorophyllide 
reductase precursor 


AI993342 


At4a27440 
















suger transpoter i\ke 
protein 


AI997793 


Af4a?Bfi70 










9 97 




nee MMrv lo it/, i, 
comAAF74568.1 


NADH-dependent 
glutamate syntliase 
(GOGAT) 


AI997600 


At5a53460 










9 9*1 






nitrate reductase (NIA2) 


AI996208 


At1g37130 










2.15 






oectate Ivase - like 
protein 


AJ508995 












9 1*? 






putative sterol 
detiydrogenase 


AI996340 


At2g43420 










2.10 






giutamine synthetase 
root isozyme 1 (GS) 


161G19T7 


At1g66200 










2.06 






monosaccharide 
transporter STP3 


AI997045 


At5g61520 










2.05 




riceBAA83554.1, 

mm AAP74'if5ft 1 


Signal transduction (6) 




















calcium-dependent 
protein kinase 


AI996555 


kt5g66210 










2.96 




riceAAF23901.2, 
com BAA12715.1 


WD-40 repeat protein 


AI993055 


At5o14530 














nee t\Ui.iooi . i, 
com AAA50446.1 


receotor-crotien kinase- 
lilce protein 


AI994727 












9 liQ 




nee MAi\Doyo4.i , 
com AAB09771.1 


putative phytochrome A 


AI998146 


At1g09570 










2.45 






Dutative leucinp-rich 
receptor-lil(e protein 
kinase 














9 "11 




rice DAUUoiiUo.i, 
comCAC35411.1 


putative receptor-like 
kinase 


Aig93298 


At3g23750 










2.06 




riceCAA69028.1, 
com CAC35412.1 


Others (13) 




















putative pollen allergen 


AI996548 


«3g45970 










3.22 




riceAAG13596.1, 
com CAD4084t) 1 


cold-regulated protein 

COR6,6 


AW004198 


^t5g15970 










3.03 






phi-1 -like protein 


M994601 


flit5g64260 










2.60 






lipid-transfer protein-like 


'\I998609 / 


'\t5g01870 










2.33 




riceBAB86497.1, 


DnaJ homologue 


M994551 / 


^t5gO6910 










2.32 


ATTGGCG 
C 




blue copper binding / 
protein 


^1996535 / 


M5g20230 










2.30 






src-2 like protein t 


M998679 / 


\t1g09070 


1 ( 


33 


W1 


102 


2.19 






RING finger protein / 


i\l999491 / 


\t3g61460 










2.14 




iceBAA85438.1, 
;om AAL59234.1 


putative Ticc22 / 


\i993361 / 


\t3g23710 










2.14 






nodulin-like protein / 


M996322 / 


^t1g80530 










2.07 




iceAAM01022.1 


putative resistance / 
protein 


M997549 / 


\t1g61100 










2.06 




iceAAL83695.1, 
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i 

AI993446 


Al5g20250 










9 OS 




rice AAL01 163.1. 


putative disease 
resistance protein 


AI998978 


M1g72900 










2.04 




cornAAC83564.1 
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ubiquitin-conjugating 
enzyme E2-17kD 8 


AI994771 


At5g41700 










2.40 


PCT; 


'EP20()3/011658 


vegetative storage 
protein Vsp2 


AI999152 


At5g24770 










2.35 






heat shock orotein 70 














2.24 






chioropiast outer 
enveloDe membranp 
protein 


AI997015 


At3g63160 










2.20 






translation initiation 
factor-like protein 


AI992786 


At5a54940 
















pseudogene 


AI995323 


At2a04110 










£..\Jl 






vegetative storage 
protein Vsp1 

r 


Aig99546 


At5g24780 










2.06 






dehydrin ERD10 


AI997518 


At1g20450 










2.06 






\^TN3-like orotein 














2.05 






putative chlorophyll A-B 
bindina orotpin 


AI994859 


At3g27690 










2.05 






ohotosvstem 1 neartinn 
centre subunitpsaN 




Al0gD4U4U 










2.03 






AR781, similar to yeast 
pheromone receptor 


AI998194 


At2a26530 










£..\)0 






putative lipid transfer 
protein 


M997024 


At2g15050 










2.03 






peroxidase ATP3a / 


M998372 


'^t5g64100 










2.03 






myosin heavy chain-like / 
protein 


M99g224 


M3g16000 










2.01 







* this sequence is present in the MIPs database version of 25 july 2002 
** this record has an updated MIPS accession number At5g50101. 



Table 3. Number of E2F elements In the different datasets 

5 





All genes (4518) 


Upregulated qenes (88) 


Dovynregulated genes (105) 


TTTCCCCC 


52 


2 


3 


TTTCCCGC 


40 


D 


D 


TTTCGCCC 


15 


0 


[) 


riTCGCCC 


13 
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0 


1 TTGCCCC 


37 


1 


1 


TTTGCCGC 


20 


0 


1 


TTTGGCCC 


55 


D 


2 


rTTGGCGC 


15 


1 


0 


ATTCCCCC 


10 


Q 


2 


ATTCCCGC 


6 


0 


D 


ATTCGCCC 


8 


0 


0 


ATTCGCCC 


14 


0 


0 


ATTGCCCC 


13 


0 


0 


ATTGCCGC 


10 


1 


0 


ATTGGCCC 


44 


0 


2 


ATTGGCGC 


9 


1 




Total 


371 


13 


11 1 



Table 4: Arabldopsis genes 1.3 fold or more upregulated In E2Fa/Dpa plants 



SEQ ID 


NO 


Gene name 


e_value 


lAiPS accession number 


ratio 


cDNA 


PROT 











83 



wo 2004/035798 



PCT/EP2003/On658 
At5g51100 |1.42 




129 


WO 2(1 
130 


104/035798 

putative protein l<inase 


0 


PCT/EP2003/01 1 
Atl 008720 


1 .00 


131 


132 


liypothetical protein 


0 


At3a 12200 




133 


134 


putative phosphatidylserine decarboxylase 


0 


At4a25970 


I.JO 


135 


136 


unknown protein 


0 


At2a03120 


1 '31 


137 


138 


unl(now/n protein 


0 




1 AQ 


139 


140 


histone H2A.F/2 


0 




1.00 


141 


142 


4.coumarate-CoA ligase - Wke 


0 


At4a190in 


1.00 


143 


144 


putative protein 


0 




1 70 


145 


146 


unicnown protein 


0 


At3a1954n 


1.04 


147 


148 


putative protein 


0 


At4a3441 f) 


I.OD 


149 


150 


unl<nown protein 


0 


At1a61?fin 


1 07 


151 


152 


putative protein 


0 


Af'^nfi14Qn 


l.oZ 


153 


154 


lipoxygenase 


0 


At1n 17490 


1.o4 


155 


156 


putative SecA-type cliloropiast protein transport factor 


0 


Af4nni fion 


1.00 


157 


158 


putative DNA-binding protein 


0 




i.4y 


159 


160 


hypothetical protein 


0 




1.0( 


161 


162 


hypothetical protein 


2E-90 


r\i 1 yr / OOKJ 


i.oy 


163 


164 


unknown protein 


0 




1.04 


165 


166 


bZIP transcription factor-like protein 






1.0Z 


167 


168 


putative protein 


1E-154 


AH 0*^6790 


I.oy 


169 


170 


hypothetical protein 


0 


rw. I y / uouu 


1.0Z 


171 


172 


6-phosphogiuconate dehydrogenase 


2E-80 


At5g41670 


1.4o 


173 


174 


ferritin 1 precursor 


0 


At5g01600 


1.00 


175 


176 


putative ABC transporter 


0 


At1g71330 


1.71 


Ml 


178 


hypothetical protein 


0 


At1g27300 


0 

1.0 


179 


180 


myrosinase precursor 

— 


9E-01 


At5g26000 


ii.ol 


181 


182 


unknown protein 


OE+00 At1g10270 


\A1 


183 


184 


putative protein 


3E-88 


At5g1865G 


.00 


185 


186 


hypothetical protein 


6E-40 


At2g36090 


■1 00 


187 


188 


unknown protein 


D 


At1g43910 


I /<0 

1 .4/: 


189 


190 


hypothetical protein 


0 


At1g07000 


0 /10 

^.4o 


191 


192 


hypothetical protein 


Q 


At1g18260 


1.40 


193 


194 


putative pre-mRNA splicing factor 


3 


At4g03430 


1.49 


195 


196 


jutative protein 


3 


At5g11810 


00 
.0<C 


197 


198 


lypothetical protein 


IE-151 


At4g30150 


1.41 


199 . 


200 . 


s-receptor kinase -like protein ( 


) 


At4g32300 


CO 

.oz 


201 : 


202 c 


Jisease resistance RPP5 like protein 


E-175 


At4g16950 


0/1 

.54 


203 : 


m I 


inknown protein ; 


E-58 


At1g76520 


AA 
.44 


205 : 


?06 F 


)utative protein i 


E-144 


At5g14420 ^ 


i.Ob 


207 : 


!08 F 


utafive glucosyltransferase i 


iE-78 


At1g23480 1 


.31 


209 : 


i10 F 


utative protein i 


E-144 


At4g28470 1 


.34 


211 2 


!12 p 


utative protein r 




^t4g29830 1 


.00 


213 5 


14 p 


utative auxin-regulated protein q 




^t2g33830 1 


.41 


215 2 


16 p 


utative protein □ 


E+00 / 


i^tSgBISSO 1 


OO 

.38 


217 2 


18 u 


nknown protein n 


J 


^t1g44810 1 


.39 


219 2 


20 p 


rotein phosphatase - like protein ■} 


E-59 / 


\t5g02760 1 


.76 


221 2 


22 h 


ypothetical protein 2 


E-21 / 


\t4g17800 1 


.oy 


223 2 


24 h 


ypothetical protein 0 


/ 


\t1g54080 1 


.58 


225 2 


26 XI 


/loglucan endo-transglycosylase, putative o 


/ 


\t1g14720 2 


.51 


227 2 


28 p 


utative protein 0 


f 


\t3g49320 1 


7 


229 2 


30 b 


3ta-1 ,3-glucanase - like protein q 




^t3g55430 2 


05 


231 2 


32 pi 


Jtative protein 0 




U3g45730 5 


14 
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wo 2004/035798 



PCT/EP2003/011658 



234 



ubiquitin-conjugating enzyme E2-21 kD 1 (ubiquitin-protein 
igase) 



At5g41340 



1.32 




86 



333 


334 


unknown protein 


0 


At3g28920 


1.93 


335 


336 


hypothetical protein 


0 


At3g03750 


1.43 


337 


338 


hypothetical protein 


2E+00 


At4g27610 


1.34 


339 


340 


transia; :;n initiation factor eiF-2 beta chain - lilce protein 


2E+00 


At5g20920 


1.33 


341 


342 


unl<nown protein 


0 


At2g26280 


1.53 


343 


344 


unknown protein 


0 


At1g78420 


1.39 


345 


346 


elongation factor, putative 


3E+00 


At1g56070 


1.99 


347 


348 


anthranilate N-benzoyltransferase - lil<e protein 


1E-120 


At5g01210 


1.66 


349 


350 


putative protein 


1E-178 


At4g39680 


1.43 


351 


352 


unknown protein 


0 


At3g05380 


1.92 


353 


354 


splicing factor At-SRp40 


0 


At4g25500 


1.52 


355 


356 


cdc2-iike protein kinase 


0 


At5g10270 


1.77 


35/ 


358 


calcium-dependent protein kinase 


1E-169 


At3g57530 


1.39 


359 


360 


phosphoprotein phosphatase, type 1 catalytic subunit 


0 


At2g29400 


1.48 


361 


362 


putative mitochondrial translation elongation factor G 


0 


At2g45030 


1.65 


363 


364 


long-chaln-fatty-acid-CoA ligase-like protein 


0 


At5g27600 


1.34 


365 


366 


cytochrome c, putative 


4E-26 


At3g27240 


1.36 


367 


368 


En/Spm-like transposon protein 


0 


At2g40070 


1.41 


369 


370 


putative phospho-ser/thr phosphatase 


0 


At4g03080 


1.41 


371 


372 


chloroplast 50S ribosomal protein L22, putative 


6E-77 


At1g52370 


1.4 


373 


374 


unknown protein 


0 


At2g15890 


1.34 


37- 


376 


putative protein 




At4g26750 


1.55 


■Hi 


378 


receptor-protein kinase-like protein 


0 


At5g54380 


2.59 


379 


380 


phosphoglycerate kinase, putative 


1E-155 


AtSa 12780 


1 88 


381 


382 


putat[ve HMG protein 




At2a17560 




383 


384 


hypothetical protein 


0 


AtlQ76100 


1 36 


385 


386 


protein kinase cdc2 homolog B 


0 


At3g54180 


2.39 


387 


388 


T-complex protein 1, beta subunit 


0 


At5g20890 


1.39 


389 


390 


proline oxidase, mitochondrial precursor (osmotic stress-induced) 


0 


At3g30775 


1.45 


391 


392 


inker histone protein, putative 


1E-126 


Atlg 14900 


1.33 


393 


394 


hypothetical protein 


0 


At1g27500 


2.21 


395 


396 


ARFI-binding protein 




At5g62010 


1.58 


397 


398 


putative protein 


0 


At5g16270 


1.37 


399 


400 


putative protein 


1E-173 


At5g 13850 


1.32 


401 


402 


src-2 like protein 


D 


At1g09070 


2.19 


403 


404 


RAN2 small Ras-like GTP-binding nuclear protein (Ran-2) 


D 


At5g20020 


1.3 


405 


406 


phosphoprotein phosphatase (PPX-1) 


D 


At4g26720 


1.42 


407 


408 


nuclear protein-like 


3 


At5g64270 


1.45 


409 


410 


Dmithine carbamoyltransferase precursor 




At1g75330 


1.41 


411 


412 


jnknown protein 


3 


At2g41650 


1.67 


413 


*14 


Dutative protein 




At5g17640 


1.66 


415 - 


416 


lypothetical protein 




At1g57990 


^02 


417 ' 


418 


lypothetical protein { 


) 


At4a15760 


.64 


419 i 


420 ( 


jlycine-rich protein 2 (GRP2) ( 


) 


At4a38680 


7? 


421 / 


422 


lypothetical protein 


E-113 


At2a41780 ; 

r»lt.yT^ 1 1 \J\J 


>6 


423 ' 


424 [ 


RfMA-binding protein, putative { 


E-95 


At3g20250 


.46 


425 I 


426 c 


jda-l, putative j 


E+00 


At3g27090 


.46 


427 ^ 


128 t 


)eta-fructofuranosidase 1 C 


) 


At3g13790 


.32 


429 I 


130 : 


?6S proteasome subunit 4-like protein C 


) 


At4g29040 1 


.51 


431 I 


132 F 


)utative protein 


E-59 


At1g33980 


.42 


433 i 


m \ 


lypothetical protein C 


) 


At1g57680 


!.66 


435 I 


136 I 


mknown protein C 


1 


At1g29980 


.98 



87 
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1437 



439 



443 



445 



447 



449 



451 



453 



455 



457 



459 



461 



463 



465 



467 



469 



471 



473 



475 



477 



479 



481 



483 



438 



440 



441 442 



444 



446 



SOS ribosomal protein - like 



RAC-like GTP-binding protein ARAC4 



unknown protein 



unknown protein 



unknown protein 



450 



putative DNA gyrase subunitA 



452 



putative protein 



454 



eukaryotic cap-binding protein (gb|AAC17220.1) 



456 



pliospiioethanolamine N-metiiyltransferase. putative 



458 



unknown protein 



460 



RANI smalt Ras-like GTP-binding nuclear protein (Ran-1) 



462 



464 



466 



468 



470 



472 



474 



putative protein 



putative auxin-regulated protein 



adenylosuccinate synthetase 



putative protein 



putative ubiquitin activating enzyme El (ECR1) 



putative mitodiondrial processing peptidase 



putative protein 



476 



hypothetical protein 



478 



hypothetical protein 



480 



putative pyrophosphate-fructose-6-phosphate 1- 
phosphotransf erase 



482 



putative receptor-like protein kinase 



484 



putative protein 



iVOLUTA or interfascicular fiberlessl 



1E-180 



2E-42 



0 



At5g02870 



At5g6Q690 



At1g20090 



At3g07390 



IE-154 



1E-153 



0 



lE-104 



At5g65660 



At3g05040 



At3g 10690 



At3g49170 



At5g18110 



At1 973600 



At2g30590 



At5g20010 



At4g24290 



At2g45210 



At3g57610 



At5g14530 



At5g19180 



E-104 



At3g02090 
At3g48530 



At1g20330 



At4g02590 



At1g12000 



At2g02220 



At4g02440 



1.39 



1.34 



1.78 



1.34 



1.7 



1.52 



2.2 



1.53 



1.41 



1.62 



2.78 



1.32 



1.39 

2.7 



.46 



.33 



1.63 



1.4 



1.55 



1.47 



1.36 



1.49 



1.55 



1.4 



485 486 



489 



491 



493 



495 



497 



499 



501 



503 



505 



507 



509 



511 



513 



515 



517 



519 



521 



523 



525 



527 



529 



531 



533 



535 



non-phototropic hypocotyl, putative 



At1g30440 



487 488 



490 



492 



494 



496 



498 



500 



502 



504 



506 



histone deacetylase 



At5g63110 



putative protein 



multicatalytic endopeptidase complex, proteasome precursor, 
beta 



fibrillarin - like protein 



6E-77 



hypothetical protein 



8E-45 



histone HI, partial 



hypothetical protein 



enoyl-CoA hydratase - like protein 



unknown protein 



7E+00 



myb-related transcription factor, putaBve 



508 



Alcohol Dehydrogenase 



510 



hypothetical protein 



E-132 



512 



unknown protein 



514 



putative receptor-like protein kinase 



At5g66580 



At4g31300 



At4g25630 



At1g54060 



At2g30620 



At3g09030 



At4g31810 



At2g27080 



At3g23250 



At1g77120 



At1g27590 



At1g14710 



516 



putative protein 



0 



518 



homeobox protein knotted-1 like 4 (KNAT4) 



1E-165 



520 



putative protein 



1E-142 



522 



carbonyl reductase-like protein 



7E+00 



524 



hypothetical protein 



E-50 



526 



aspartate-tRNA ligase - like protein 



528 



530 



532 



unknown protein 



1E-121 



amino acid transporter protein-like 



12-oxophytodienoate reductase (0PR1) 



534 



calnexin homolog 



6E-25 



At2g 13790 



At5g14550 



At5g11060 



AtSg 15540 



At5g51030 



At1g53900 



At4931180 



At3g06150 



At5g65990 



536 



unknown protein 



0 



At1g76680 
At5g07340 



At1g61100 



1.57 



1.36 



3.18 



1.42 



1.3 



1.36 



.58 



1.45 



1.31 



1.51 



1.49 
5.09 



1.38 



1.36 



1.68 



1.39 



1.4 



1.47 



2.17 



.36 



1.62 



1.74 



1.59 



1.43 



1.39 



2.06 
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641 j642 [ hypot hetical protein 
643 



644 



putative protein 



645 645 



SNF-2like RING finger 



647 648 



unknown protein 



649 650 



putative glucosyl transferase 



PCT/EP2003/011658 

At1g56290 | -33 



Al3g62630 



At1g61140 



At3g16310_ 
At2g35800 



1.49 



1.36 



.38 



.42 



B51 652 



putative protein 



At4g25170 



653 654 



hypothetical protein 



9E-39 



At4g00450 



655 . 655 



glutathione S-transferase 



At2g30860 



557 658 



unknown protein, 3' partial^ 



At3g15095 



659 660 



unknown protein 



661 W~ TCH4 protein (9blAAA92363.1) 



663 664 



putative protein 



665 666 



667 668 



669 670 



671 672 



673 674 



At3g21080 



receptor-like kinase, putative 



putative 2.3-bisphosphoglycerate-independent phosphoglycerate 



putative protein 



DnaJ homolc9ue(gblAAB91418.1l) 



hypothetical protein 



At5g57560 



At3g61600 



At3g23750 



At1g09780 



At5g14250 



At5g06910 



At1g33250 



1.36 



1.49 



1.42 



.31 



1.34 



2.06 



134 



1.51 



2.32 



1.35 



675 



677 



681 682 



689 690 



691 692 



693 694 



703 704 



705 706 



707 708 



709 710 



711 712 



713 714 



715 716 



717 718 



719 720 



676 
1678" 



unknown protein 



aspartate carfaamoyltransferase precursor (aspartate 



679 680 



hypothetical protein 



unknown protein 



683 684 



685 686 



687 688 



At2g19800 



3E-84 



At3g20330 



5E+00 



putative protein 



hypothetical protein 



RNAhelicase, DRH1 



putative 3-oxoacyl [acyl-carrier protein] reductase 



putative cellular apoptosis susceptibility protein 



hypothetical protein 



695 696 



starch branching enzyme II 



697 698 



40S ribosomal protein -like 



699 .700 



putative protein 



701 702 



putative protein 



2E-61 



hypothetical protein 



AP2 domain containing protein f^P2.3 



putative protein 



NADH-dependent glutamate synthase 



a rginine/serine rich splicing factor RSP3 
hypothetical protein 



translation initiation factor e!F3 - like protein 



putative serine/threonine protein phosphatase catalytic subunit, 



unknown protein 



721 722 



COP 1 -interacting protein CIP8 



723 724 



nonphototropic hypocotyl 1 



725 726 



putative protein 



727 728 



putative protein 



729 730 



731 732 



733 734 



735 736 



737 738 



739 740 



1 



'41 742 



743 744 



E+00 



E-55 



E-179 



E-142 



At1g16520 



Atl 948620 



At4g35750 



At3g13620 



At3g01540 



At1g24360 



At2g46520 



At1g01540 



At2g36390 



At5g15200 



At4g13640 



At3g45970 



2E-9 



1E-47 



4E-59 



At1g66160 



At3g16770 



At5g02880 



At5g53460 



At3g51860 



1E-134 



At1g55880 



6E-77 



At4g20980 



1E-105 
0 



6E+00 



1E-78 



putative protein 



1E-127 



putative protein 



cytochrome P450 monooxygenase 



thylene responsive element binding factory 



hypothetical protein 



unknown protein 



putative protein kinase 



DNA repair protein RAD23 homotog 



E-104 



1E-123 

0 

0 



At2g42500 



At1g33480 



At5g64920 



At3g45780 



At5g10860 



At5g19750 



At3g52500 



At4g10280 



At4g31500 



At4g17500 



At1g17620 



At3g07390 



At3g02880 



At5g38470 



90 



1.49 



1.35 



1.33 
1.39 



1.79 



.56 



.42 



1.43 



1.31 



1.36 



1.32 



1.33 



3.22 



1.31 



1.51 



1.32 



2.25 



1.31 



.37 



1.45 



1.38 
1.91 



1.4 



1.47 



1.32 



1.37 



1.39 



1.76 



1.35 



.33 



1.37 
'1.42 



1.46 



1.42 
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745 


746 


GTP-binding protein - like 


1E-25 


At5g03520 


1.57 


747 


748 


putative protein 


0 


At3g63500 


1.4 


749 


750 


putative adenylate i<inase 


4E+00 


At2g39270 


1.37 


751 


752 


protein l<inase - Ike 


6E-46 


At5g59010 


1.42 


753 


754 


uni<nown protein 


0 


At3g04630 


1.58 


755 


756 


RNA binding protein 


0 


At1g73490 


1.32 


757 


758 


putative phosphollpase D 


0 


At3g15730 


1.51 


759 


760 


importin alpha 


1E-115 


At3g06720 


1.45 


761 


762 


RING-H2 finger protein RHF2a 


0 


At5g22000 


1.43 


763 


764 


putative protein 


2E-93 


At4g19160 


1.3 


765 


766 


putative protein 


0 


At4g32440 


1.41 


767 


768 


putative protein phosphatase type 2C 


0 


At3g15260 


1.61 


769 


7/0 


putative protein 


0 


At5g39890 


1.31 


771 


772 


ribosomal protein 


0 


At4g16720 


1.42 


773 


774 


dormancy-associated protein 


9E+00 


At1g28330 


2.01 


775 


776 


auxin-lnducible gene (iAA2) 


0 


At3g23030 


1.65 


777 


778 


unknown protein 


5E+00 


At1g76010 


1.54 


779 


780 


protein kinase ADKI-like protein 


1E+00 


At4g28540 


1.96 


781 


782 


putative protein 


0 


At4g24210 


1.36 


783 


784 


hypothetical protein 


0 


At1g79530 


1.4 


785 


786 


putative trehalose-6-phosphate synthase 


0 


At1g68020 


1.45 


787 


788 


adenylate kinase 


0 


At5g63400 


1.39 


789 


790 


putative proline-rich protein precursor 


0 


At1g73840 


1.56 


791 


792 


putative protein 


5E-87 


At5g14370 


1.37 


793 


794 


hypothetical protein 


0 


At4g33290 


1.7 


795 


796 


cytochrome P450 monooxygenase (CYP71B3) 


0 


At3g26220 


1.32 


797 


798 


TMV resistance protein N - like 


0 


Af4g 19530 


1.5 


799 


800 


hypothetical protein 


9E-70 


At1g54830 


1.33 


801 


802 


3-ketoacyl-CoA thiolase 


0 


At2g33150 ' 


1.87 


803 


804 


putative protein 


0 


At3g54350 


1.35 


805 


806 


hypothetical protein 


1E-170 


At4g02680 


1.36 


807 


808 


putative bHLH transcription factor 


0 


At2g46510 


1.35 


809 


810 


RNA-binding protein, putative 


5E-84 


At3g26420 


1.55 


811 


812 


putative lectin 


3E-20 


At3g09190 


1.67 


813 


814 


xylogiucan endotransglycosylase, putative 


0 


At3g23730 


2.85 


815 


816 


unknown protein 


2E-33 


At2g41170 


1.32 


817 


818 


putative protein 


3E-78 


At3g57150 


1.67 


819 


820 


putative glucose regulated repressor protein 


0 


At2g25490 


1.81 


821 


822 


putative AP2 domain containing protein RAP2.4 gi|2281633 


1E-150 


At1g78080 


1.82 


823 


824 


putative sulfate transporter 


0 


At1g8031O 


1.51 


825 


826 


G protein alpha subunit 1 (GPA1) 


0 


At2g26300 


1.44 


827 


828 


protochlorophyllide reductase precursor 


0 


At4g27440 


2.39 


829 


830 


Shaggy related protein kinase tetha 


0 


At4g00720 


1.52 


831 


832 


Qutative protein kinase 


D 


At3g01300 


1.49 


833 


334 


RNA-binding protein-like protein 


D 


At3g47160 


1.31 


835 


336 


jnknown protein 


1E-150 


At5g24670 


1.47 


837 J 


338 < 


anc finger protein ZFP8 


1E-144 


At2g41940 


1.42 


839 1 


340 


3TP binding protein beta subunit 


3 


At4g34460 


1.54 


841 ( 


342 ( 


X)pia-like retroelement pol polyprotein 


3 


At2g22680 


1.4 


843 f 


344 


30NSTANS-like B-box zinc finger protein-like 


D 


At5g57660 


1.36 


845 i 


346 


jnknown protein 


3E-71 


At3g10640 


1.33 


847 { 


348 


mutative protein 


D 


At4g24690 


1.91 
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953 


954 


histone HI 


0 


At2g18050 


1.75 


955 


956 


unknown protein 


0 


At1g08630 


1.45 


957 


958 


unknown protein, 5'partial 


0 


At3g18035 


3.31 


959 


960 


unknown protein 


0 


At1g04140 


1.37 


961 


962 


HAL3A protein 


0 


At3g18030 


1.43 


963 


964 


plii-1-like protein 


0 


At5g64260 


3.38 


965 


966 


tiypothetical protein 


0 


At1g12770 


1.35 


967 


968 


pollen specific protein SF21 


0 


At5g56750 


1.45 


969 


970 


cysteine proteinase inhibitor like protein 


1E-159 


At4g16500 


1.33 


971 


972 


20S proteasome subunit C8 (PAG1/PRC8 ARATH) 


1E-130 


At2g27020 


1.36 


973 


974 


nodulin-like protein 


1E-99 


At1g75500 


1.34 


975 


976 


hypottietical protein 


0 


At1g72900 


2.04 


977 


978 


iiypottietical protein 


0 


At2g35230 


1.42 


979 


980 


arm repeat containing protein tiomolog 


0 


At3g46510 


1.4 


981 


982 


putative protein 


0 


At5g67480 


1.76 


983 


984 


putative leucyl-tRNA synthetase 


1E-118 


At1g0g620 


1.52 


985 


986 


Putative UDP-glucose glucosyltransferase 


1E-164 


At1g22400 


2.34 


987 


988 


alanine aminotransferase, putative 


0 


At1g17290 


1.66 


989 


990 


26S proteasome AAA-ATPase subunit RPTSa 


0 


At5gig990 


1.36 


991 


992 


Ruv DNA-helicase-like protein 


0 


At5g22330 


1.59 


993 


994 


small nuclear ribonucleoprotein, putative 


0 


At1g65700 


1.33 


995 


996 


unknown protein 


0 


At2g38310 


2.79 


997 


998 


protein phosphatase type 1 PP1BG 


3E-91 


At4g11240 


1.51 


999 


1000 


hypothetical protein 


3E41 


At2g43410 


2,1 


1001 


1002 


putative protein 


0 


At5g58600 


1.42 


1003 


1004 


nodulin-like protein 


0 


At1g80530 


2.07 


1005 


1006 


}utative protein 


0 


At5g56170 


1.65 


1007 


1008 


dihydroxyacetone kinase, putative 


1E-167 


At3g17770 


1.67 


1009 


1010 


r bsomal protein - like 


1E-155 


At5g09770 


1.44 


1011 


1012 


101 kDa heat shock protein; HSPIOMike protein 


0 


At5g57710 


1.34 


1013 


1014 


unknown protein 


0 


At5g51340 


1.48 


1015 


1016 


unknown protein 


0 


At3gD5730 


1.46 


1017 


1018 


DUtative protein 


2E+00 


At5g67570 


2.6 


1019 


1020 


mitochondrial chaperonin (HSP60) 


0 


At2g33210 


1.75 


1021 


1022 


putative protein 


1E-177 


At3g63270 


1.34 


1023 


1024 


growth factor like protein 


0 


At4g 12720 


1.78 


1025 


1026 


RNA hellcase, putafive 


D 


At3g19760 


1.54 


1027 


1028 


aseudogene 


E-142 


At2g34760 


1.81 


1029 


1030 


lypothetical protein 


3 


At3g21740 


1.52 


1031 


032 


shaggy-like kinase beta 


3 


At3g61160 


1.36 


1033 


1034 


jnknown protein 


E-165 


At1g20100 


1.35 


1035 


036 


24-sterol C-methyltransferase 


E-143 


At5g 13710 


1.41 


1037 


038 \ 


i/VD-40 repeat protein (MSI3) 




At4g35050 


1.89 


1039 


040 


lypothetical protein 


3 


At1g67120 


1.51 


1041 


042 


jutative protein (fragment) ( 


3 


At5g 14930 


1.46 


1043 


044 


)utative protein 


E-6 


At5g54180 


1.78 


1045 


1046 


lypothetical protein 


E-126 


Atl 920570 


.43 


1047 


048 c 


;alcium-dependent protein kinase ( 


3 


At5g66210 : 


1% 


1049 


050 r 


litrilase 2 


E-127 


At3g44300 


.88 


1051 


052 r 


nethionyl-tRNA synthetase - like protein 


E-173 


At4g13780 


.33 


1053 ■ 


054 


)utative protein 


) 


At4g24230 


.58 


1055 • 


1056 


)utative protein ^ 


lE-76 


At5g 19330 


.33 
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1161 


d CO 


jnknown Drotein 1E-102 


At1a07360 

fW, 1 W w f www 


1.74 


IIdo 




ysyl-tRNA synthetase 


lE-96 


At3a11710 

f^lWU III 1 w 


1.38 


A A f^C * 

1165 


166 


jnknown protein < 




At^noyyan 


1.51 


1167 


168 


ryptophan synthase beta chain 1 precursor (sp|P14671) 


1E-102 


rMoyoto \ \} 


1 55 


1169 


170 


Dutative protein ' 


3E-98 




1 81 

I ,\J 1 


1171 


172 


RuvB DNA helicase-like protein 


} 


rMOyDl DOU 


1 'V) 


1173 


174 


putative pectin methylesterase 


3 


Mlog I'to lu 


1 4"^ 


1175 . 


176 


putative cytldine deaminase 


3 


Mizg loOlU 


1 41 


1177 


178 


hypothetical protein 


D 


Mug 1 uu 


1 4? 


1179 


1180 


1-aminocyclopropane-1-carboxylate synthase -like protein 


D 




1.54 


1181 


182 


peroxidase 


3E-88 




2.11 


1183 


1184 


2-oxogiutarate dehydrogenase, El cx3mponent 


D 


rVioguo/ uu 


1.44 


1185 


1l86 


xylosidase 


D 


r\iog*ryouu 


1.93 


1187 


1188 


ethylene responsive element binding factor 4 


D 


Mlog lu^ lu 


1.7 


1189 


1190 


putative protein 


2E+00 




3 54 


1191 


1192 


eukaryotic protein synthesis Initiation factor 4A 


0 


A*^n1 '^Q'jn 
MlOg 1 CyoiSJ 


1 35 


1193 


1194 


Unknovi/n protein 


0 




2 34 


1195 


1196 


hypothetical protein 


0 


AM 1 g 1 900U 


1 54 


1197 


1198 


unknown protein 


0 


AfRn^QR4r) 


1.78 


1199 


1200 


putative xyloglucan-specific glucanase 


0 


AtOnm fii^n 

MliLyU 1 03U 


1.58 


1201 


1202 


similar to nucellln gb|AAB96882.1 


1E-106 


MI 1 y'toUOU 


1 5 


1203 


1204 


unknown protein 


0 


Miogi£909U 


1 33 


1205 


1206 


putative protein 


0 


AUgoZlgU 




1207 


1208 


putative maiate dehydrogenase 


0 


Ml 1 gU44 1 U 




1209 


1210 


putative isocitrate lyase 


1E-153 


Alogzi t zu 


o.uo 


1211 


1212 


DNA-binding protein 


1E-160 


Atogi4Zou 


1 .*rO 


1213 


1214 


histone H4-like protein 


0 


Aiog4Do/:U 




1215 


1216 


putative dehydrogenase 


0 


Atig/n/u 


1 47 


1217 


1218 


aianine-tRNA ligase, putative 


0 


Afi fiRnonn 
Ai 1 gouzuu 


1 • 


1219 


1220 


oligopeptidase A - like protein 


1E-172 


Mioy 1 uo*?u 


1 43 


1221 


1222 


putative protein 


0 


AtCnRQfiOO 
AlOyDZOZu 


1 32 


1223 


1224 


permease 


0 


AfCnAQQQn 

AlOyHyyyu 


1 .o 


1225 


1226 


DEAD BOX RNA helicase RH1 5 


1E-129 


Aiog 1 1 zuu 


1 4 


1227 


1228 


lipoamide dehydrogenase precursor 


1E-12B 


AWg 1 / Z'HJ 


1 


1229 


1230 


hypothetical protein 


0 


Ai 1 g 1 0 1 / u 


1 y*5 


1231 


1232 


xyloglucan endo-1,4-beta-D-glucanase (XTR-6) 


0 


Awg/coo 1 u 




1233 


1234 


histone H2B like protein (emb|CAA69025.1) 


7E-34 


Aiogi^zoou 




1235 


1236 


S-receptor kinase homotog 2 precursor 


1E+00 


AfinRnQfin 

Aiogouyuu 


? f51 


1237 


1238 


60S ribosomal protein L2 


7E-48 


Ml^y 1 0\}cX} 


1 '58 


1239 


1240 


unknown protein 


0 


At1g23030 


1.98 


1241 


1242 


zinc finger protein, putative 


0 


At1g34370 


1.51 


1243 


1244 


putative protein 


3E-8 


At4g05150 


1.38 


1245 


1246 


aldose 1-epimerase - like protein 


5E-25 


At3g47800 


1.88 


1247 


1248 


dnnamoyl-CoA reductase - like protein 


0 


At5g58490 


1.35 


1249 


1250 


putative NADP-dependent glyceraldehyde-3-phosphate 

dehydrogenase 


0 


At2g24270 


1.43 




d OCT 


isp4 like protein 


0 


At4g 16370 


1.77 


1253 


1254 


putative protein 


0 


At4g08350 


1.32 


1255 


1256 


calmodulin-related protein 2, touch-induced (TCH2) 


0 


At5g37770 


1.55 


1257 


1258 


20S proteasome subunit PAD2 (gb!AAC32059.1) 


0 


At5g66140 


1.5 


1259 


1260 


glucosidase II alpha subunit 


0 


At5g63840 


1.35 


1261 


1262 


putative GAR1 protein 


0 


At3gO3920 


1.74 



95 



wo 2004/035798 



PCT/EP2003/011658 




2729 


2730 


GATA Zn-finger protein 






2.75 


2731 


2/32 


unknown protein 




At5g53740 


2.01 


2733 


2734 


putative glucosyltransferase 




At2g15480 


2.15 


2735 


2736 


Antliocyaninless2 




At4g00730 


2.73 


2737 


2738 


pectate lyase-lil<e protein 




At3g54920 


2.13 


2739 


2740 


putative sterol dehydrogenase 




At2g43420 


2.10 



*** This accession number was replaced by a new annotation and called At1g36160 
Table 5: Arabidopsis genes 1.3 times (1/ratio) or more repressed in E2Fa/Dpa plants 



SEQI 


DNO 


Gene name 


E-value 


MIPS accession Number 


Ratio 


cDNA 


PROT 










1349 


1350 


putative glutathione peroxidase 


0 


Af9n^1'57n 


n CI 
U.01 


1351 


1352 


phenylalanine ammonia lyase (PAL1) 


0 




U.bO 


1353 


1354 


unknown protein 


0 






1355 


1356 


putative protein 


0 






1357 


1358 


water channel - like protein 


1 Lb I £0 




n 7 
U.I 


1359 


1360 


catalase 


0 




U.4b 


1361 


1362 


stearoyl-ACP desaturase 


2E-11 




n Kyi 

U.04 


1363 


1364 


putative oligopeptide transporter 


0 


At4fi 10770 


u.o/ 


1365 


1366 


putative chloropiast SOS ribosomal protein L28 


n 


MlcyOOf OU 


\j.l6 


1367 


1368 


fen'edoxin-NADP reductase precursor, putative 




AHnonnon 

MllyjiUU^U 


0.04 


1369 


1370 


3-beta-hydroxysteroid dehydrogenase 






\J.li 


1371 


1372 


putative alanine aminotransferase 




ML 1 y / UuOu 


U.O 1 


1373 


1374 


hypotnetical protein 


*Tt OCT 


Af'In'iR'^nn 

ML lyJOOUU 


r\ eta 
U.Db 


1375 


1376 


putative protein 


u 




U.b4 


1377 


1378 


putative protein 


1E-158 


At'in?fiQ7n 


C\ 7 


1379 


1380 


actin depolymerizing factor 4 - like protein 


0 


AWn'iQRQn 

/AWyji703U 


u.bb 


1381 


1382 


hypothetical protein 


7F-79 


nljy4D 1 DU 


U.O 


1383 


1384 


transporter-like protein 




At'^n'i'^QRn 

MlOyjO!7DU 


U.bO 


1385 


1386 


nicotianamine synthase (dbj|BAA74589.1) 


n 




U.JD 


1387 


1388 


cytochrome P450 monooxygenase (CYP83A1) 


n 




A QO 


1389 


1390 


unknown protein 






\J.77 


1391 


1392 


hypothetical protein 




nug 1 <^oou 


A Ct^ 


1393 


1394 


unknown protein 


0 


At5g64130 


0.52 


1395 


1396 


putative protein 


0 


At3g61870 


0.73 


1397 


1398 


fructose-bisphosphate aldolase - like protein 


0 


At4g26530 


D.17 


1399 


1400 


ectin like protein 


1E-124 


At4g19840- 


D.74 


1401 


1402 


unknown protein 


0 


At1g28140 


D.72 


1403 


1404 1 


eebly-like protein 


D 


At3901420 


D.73 


1405 


1406 


Deta-fructosidase 


1E-105 


At1g62660 


3.38 


1407 


1408 


Jnknown protein 


lE-06 


At1g15350 


}.77 


1409 ' 


410 f 


jeptidylprolyl isomerase R0C1 ( 


3 


At4g38740 


3.76 


1411 • 


412 I 


lypothetical protein 


IE-36 


At2g06010 ( 


3.74 


1413 1 


414 f 


lutative protein 


E-114 


At4g30490 ( 


}.5 


1415 1 


416 : 


i-isopropylmalate dehydrogenase ( 


) 


At5g14200 


).61 


1417 1 


418 F 


)utalive copper/zinc superoxide dismutase 1 


E-93 


At2g28190 


3.77 


1419 1 


420 F 


)utative myo-inositol 1-phosphate synthase C 


) 


At2g22240 ( 


).68 


1421 1 


422 F 


utaBve enolase (2-phospho-D-glycerate hydroylase) 


) 


ftt2g29560 


).7 


1423 1 


424 L 


nknown protein C 


) 


At5g43750 


).4 


1425 1 


426 p 


utative protein 1 


E-?? 


^t4g32330 


).68 
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1427 



1428 



1429 



1430 



1431 



1433 



1435 



putative ferredoxin-thioredoxin reductase 



1432 



1434 



hypothetical protein 



1E+00 



putative cellulose synthase 



putative protein 



1436 



1437 



1439 



1441 



1443 
1445 



putative protein 



1438 



1440 



1442 



1444 



PCT/EP2003/011658 



At2g04700_ 



Attg23290_ 



hypothetical protein 



unknown protein 



hypothetical protein 



hypothetical protein 



1447 



1449 
1451 



1453 



1455 



1457 



1459 



1461 



putative protein 



unknov>fn protein 



1458 
1460 



protein phosphatase ABI1 



1462 



1463 



1455 



1457 



1464 



1466 



1468 



piUlglll HMUJK"" — ,_^j.»> . 

33 kDa polypeptide of oxygen-evolving complex (OEC) >n 
photosystem 



lE-115 



beta-xylosidase 



GDP-mannose pyrophosphorylase 



1E-163 



peroxidase ATP20a (emb|CAA6/a3B.ii 



putative glutathione transferase 



1469 



1470 



1471 



1472 



putative protein 



At5g66570 



At5g64570 



At2g39770 



At5g14130 



At1g17190 



unknown protein 



SOS ribosomal protein L24. chloropiast precur su^ 



1E-179 
0 




At4g38080 



At1g61190 



At5g54600 



At1g68260 



" At3g14067 



1491 



1493 



1492 



1495 



1497 



1494 



putative protein — 

ribulose bisphosphate cartwxylase ifi^iilchiin 3b precursor 

(RuBisCO 



Myb DNA binding protein -like 



1496 putative 2-cys pero)qredoxiri 



1498 putative trypsin inhibitor 



1499 



1501 



1500 Omethyltransf erase 



At4g23890 



At3g01690 



At3956290 
At2939450 



At5g64130 



1E-145 



At4g301l40_ 



At5g38410 



At3g46130 



1E-127 



1502 jhypothetical protein 



1503 



1504 



1505 



1507 
1509 



1510 



1511 



1513 



1515 



1517 



1519 



1521 



1523 



1525 



1514 



At3g11630 



At1g73260 



At5g54160 



At1g29270 



1516 



1518 



1520 



1522 



0.75 



0.59 




0.55 



0.62 



0.71 



0.75 



0.7 



0.76 



0.55 



10762 



10.59 



10.7 



l0.3 



0.67 
0.66 



0.54 



0.54 



0.75 



0.64 



0.59 



0.62 




jmyosin heavy chain-like protein 
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1 \J£.I 




uiii\fiuwn protein 


u 


Anguyb lU 


0.76 






aldUinUgolowlan prOIcIn - IIKc 


A 

u 


A jCcAO'i '7r\ 

Atoguol (yj 


0.71 


1*1^1 

UO 1 


1 ooz. 


biotin carboxyl carrier protein of acetyl-CoA carboxylase 
precursor 


A 
U 


AiCn't conn 


0.69 


1 jOO 




centrin 


A 
U 


At3g50360 


0.74 


IC300 


lOOD 


vegetstive storage protein Vspl 


A 

0 


At5g24780 


0.48 


100/ 


\ooo 


protein kinase, putative 


1c-61 


A ill . cno A A 

At1g523lQ 


0.63 


loos 


1 0*fU 


uriKnown protein 


1E-132 


A iA^ A A 

At2g42760 


0.63 


I Of 1 


1 0*f^ 


pnenyiaianine ammonia lyase {PALI ) 


0 


A4A— A^A ^ A 

At2g37040 


0.72 


1543 


1544 


UDP rtiamnose-anthocyanidin-S-glucoside rhamnosyltransferase 
- like 


0 


At4g27560 


0.45 


lOf 0 


1 Of 0 


unKnown protein 


0 


At2g 17500 


0.54 


IOh/ 


1 o*to 


NRL> aomain protein, putative 


0 


AtW — A*4^AA 

At1g01720 


0.72 




1 oou 


ubiquitin-conjugating enzyme-lii<e protein 


or* 

2E-24 


A IC ^Cf> A CA 

At5g56150 


0.41 


1001 


103^ 


putative RNA-binding protein 


1E-136 


At2g37220 


0.72 


lOOJ 


I004 


uveriap witn oases o7,142-90,4i:5 of IGr BAG clone F9K2u, 
accession 


0 


At1g78570 


0.52 


1555 


1556 


hypottietical protein 


1E-105 


At2a04040 




1557 


1558 


Isp4-like protein 


4E-01 






1559 


1560 


ids4-like protein 


0 


At5a20150 


0 58 


1561 


1562 


unknown protein 


3E-98 




n R7 


1563 


1564 


R2R3-MYB transcription fiactor 


0 


At3a50060 


n fiR 


1565 


1566 


)utative hexose transporter 


0 


At4an9n^n 


u.uo 


1567 


1568 


one lielix protein (OHP) 




AfinflPI 90 


n "17 

U.J/ 


1569 


1570 


UDP-olticosH dfihvdrnof»na<?p-likp nrntpin 


n 

u 


MlOy 1 O^ou 


n 74 


1571 


1572 


Dutative orotein 


n 


rVloyw'r^wU 


n fi"? 

U.OO 


1573 


1574 


}utative L5 ribosomal nrotpin 


n 

u 


rM*ry u 1 o 1 w 




1575 


1576 


jutative myosin tieavy ctiain 






f) R1 


1577 


1578 


cIdB iieat shock orotein-like 


0 




n '57 


1579 


1580 


unknown protein 


4E-71 




Ofifi 

'.Uv/ 


1581 


1582 


Deta-fructosidase, putative 


0 

V 


At1o12240 




1583 


1584 


riypottietical protein 


0 


At1 047670 




1585 


1586 


outative protein 




At'in9'58Qn 


n 7*; 

J.I \j 


1587 


1588 


predicted protein 


1E-108 

1 L. 1 wU 


At4n'^1'^QD 




1589 


1590 


lutative phosptiolipase 


n 


ru^y u S7H^ V 




1591 


1592 


ATP-deoendent transmsmbrane tran<!nortpr niitativp 


0 


At1n*S14Rn 
r\l 1 y «J 1 tou 


n 74 

J. / H 


1593 


1594 


H+-transDOrtina ATP svnthase-iilce nrotein 


0 

J 




n R4 


1595 


1596 
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0 


At4g1 1 500 


A 7*1 

U.f 1 






acuuiit;(jiuiube-oispnospna[aSB piccursor 


ft 


Aiogyoouu 




2029 


2030 


CIp proteinase like protein 


2E-55 


At4g 17040 


0.75 


2031 


2032 


unknown protein 


0 


At5g07020 


0.68 


2033 


2034 


unknown protein 


2E-32 


At5g51720 


0.49 


2035 


2036 


endomembrane protein, putative 


1E-117 


At1g14670 


0.75 


2037 


2038 


putative phosphomannomutase 


0 


At2g45790 


0.66 


2039 


2040 


putative protein 


1E-95 


At4g27280 


0.46 
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2041 



2042 mrp protein, putative 



2043 



2044 [putative vacuolar ATPase 



2045 



2046 phosphate transporter, putative 



2047 



2048 Isimilar to Trp Asp repeat protein emb|CAB39845.1 ; simiiaiTo EST 



2049 



2050 Iputative IVIAP kinase 



2051 



2053 



2052 jethylene-responsive franscriptional coactlvator, putative 
2054 S-phosphogluconolactonase-lilce protein 



2055 



2056 [beta-amylase-like proten 



2057 



2058 [hypothetical protein 



2059 



2060 jchloroplast SOS ribosomal protein L31. putative 



2061 



2062 Icytochrome P450-like protein 



2063 



2065 



2067 



2069 



2071 



2073 



2075 



2077 



2079 



2081 



2083 



2085 



2E-18 



0 



1E-175 



3E-53 



At3g24430 



At4g02620 



At3g26570 



At1g78070 



At2g01450 



At3g24500 



At5g24420 



At5g18670 



2064 Iputative potassium transporter AtKTSp (AtKTS) 



20'66 [putative ribosomal-protein S6 kinase (ATPK6) 



2068 jhypotheticai protein 



2070 [transcription factor HapSa 



2072 [putative protein 



2074 [beta-expansin 



2076 |SQUL-like protein 



2078 [unknown protein 



At1g20970 



At1g75350 



At4g37320 



At4g33530 



At3g08730 



6E-74 



4E-82 



2080 [unknown protein 



1E-124 



2082 [unknown protein 



1E-160 



2084 [unknown protein 



2087 



2086 Ia DPG pyrophosphorylase small subunit (gb|AAC39441.1) 
2088 [unknown protein" — — 



2089 



2090 Ihypothetical protein 



2091 



2092 [unknown protein 



2093 



2095 



2097 



2099 



2101 



At1g0477Q 



At3g48590 



At5g20070 



At2g20750 



AtlglTlOO 



At1g70760 



At2g20890 



At1g07280 



At1g64680 



At5g48300 



At2g17340 



At1g268Q0 



At1g22930 



2094 polyphosphoinositicle binding protein, putative 



2096 caffeoyl-CoA 0-mettiyltransferase - like protein 



2098 [pectinesterase 



2100 [unknown protein 



2102 |p-nitrophenylphosphatase-like protein 



7E-75 



0 



2103 



2104 (putative protein 



2105 



2106 [serine/threonine protein kinase protein 



2107 



2108 Icytosoiic factor, putative 



2109 



2110 S-aclenosylmethionine:2-demethylmenaquinone 
methyltransferase-like . 



2111 



2112 [pectate lyase 



2113 



21 1 4 [vacuolar sorting receptor-like protein 



2115 



21 1 6 Iputative membrane ctiannel protein 



2117 



2119 



2121 



2123 



21 1 8 iputative thylakoid lumen rotamase 



21 20 [putative chloroplast prephenate dehydratase 



2125 



2127 



2129 



2131 



2133 



2135 



2137 



2139 



2141 



1E-172 



1E-159 



2122 |3-oxoacyl-[acyi-carrier-protein] synthase I precursor 



2124 |P-Protein - like protein 



2126 [NHE1 Na+/H+ exchanger 



2128 



receptor kinase-like protein 



2130 



raffinose synthase -like protein 



2132 



thylakoid luminal protein 



2134 



unknown protein 



2136 



2138 



2140 



2142 



At1g01630 



At4g34050 



At5g53370 



At1g64370 



At5g36790 



At5g55960 



At5g10930 



At1g72160 



At5g56260 



At5g63180 



At4g20110 



At2g28900 



At3g01480 



1E-108 



1E-122 



0 



At3g44720 



Al5g46290 



At4g33010 



At5927150 



At3g47580 



At5g40390 



beta-xylan endohydrolase -like protein 



putative protein 



9E-02 
1E-137 



putative ribose 5-phosphate isomerase 



10 



putative protein 



At1g54780 



At2g46170 



At4g33810 



At4g12700 



At3g04790 



At5g47840 



075 



074 



0.61 



0.74 



0.51 



0.52 



0.4 



072 



0.74 



0.67 



0.76 



0.63 



0.68 



0.6 



0.69 



0.72 



0.71 



0.4 



0.73 



0.72 
0.65 



0.68 



0.61 



0.74 



0.67 



0.72 



0.67 



0.56 



0.43 



0.52 



0.64 



0.26 



0.67 



0.76 



0.67 



07 



076 



0.56 



073 



0.76 



0.73 



073 



0.72 



0.59 



0.63 



0.73 



0.26 



0.6 



0.76 



0.7 
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2143 


2144 


putative RNA-binding protein 


0 


Atl 909340 


0.57 


2145 


2146 


adenine phosphoribosyltransferase (EC 2.4.2.7) - like protein 


0 


At4g22570 


0.46 


2147 


2148 


unknown protein 


0 


At3g15950 


0.37 


2149 


2150 


putative glutathione peroxidase 


7E-12 


At2g25080 


0.46 


2151 


2152 


putative protein 


0 


At5g23060 


0.63 


2153 


2154 


pectate lyase l-like protein 


0 


At1g67750 


0.42 


2155 


2156 


putative triosephosptiate isomerase 


9E-61 


At2g21170 


0.66 


2157 . 


2158 


carbonate detiydratase - like protein 


0 


At4g33580 


0.72 


2159 


2160 


putative protein 


0 


At5g37300 


0.56 


2161 


2162 


putative protein 


1E-143 


At3g60080 


0.77 


2163 


2164 


cystatin (emblCAA03929.1) 


2E-83 


At5g12140 


0.74 


2165 


2166 


putative cytoctirome b5 


0 


At2g46650 


0.46 


2167 


2168 


putaive DNA-binding protein 


2E-08 


At4g31550 


0.63 


2169 


2170 


tiypothetical protein 


lE-143 


At3g21050 


0.5 


2171 


2172 


putative beta-hydroxyacyl-ACP detiydratase 


0 


At2g22230 


0.59 


2173 


2174 


2-oxoglutarate/malate translocator 


0 


At5g64290 


0.77 


2175 


2176 


tiypottietical protein 


1E-123 


At3g27050 


0.49 


2177 


2178 


putative alcohol detiydrogenase 


9E-64 


At2g37770 


0.64 


2179 


2180 


hypothetical protein 


1E-107 


At1g18730 


0.67 


2181 


2182 


putative pectinacetylesterase 


0 


At4g19420 


0.71 


2183 


2184 


similar to ADP-ribosylation factor gb|AAD17207; similar to ESTs 


2E-80 


At1g 10630 


0.67 


2185 


2186 


hypothetical protein 


0 


At1g04420 


0.67 


2187 


2188 


putative protein 


0 


At4g26710 


0.62 


2189 


2190 


putative protein 


0 


At4g34630 


0.72 


2191 


2192 


latex protein 


0 


At1g70890 


0.29 


2193 


2194 


RCc3- like protein 


0 


At4g22490 


0.57 


2195 


2196 


hypothetical protein 


5E-53 


At1g20450 


0.49 


2197 


2198 


glucosyltransferase-like protein 


3E-31 


At5g22740 


0.65 


2199 


2200 


glutathione S-transferase 


0 


At2g29450 


0.52 


2201 


2202 


putative protein 


0 


At3g44450 


0.59 


2203 


2204 


cysteine synthase 


0 


At5g28020 


0.6 


2205 


2206 


ATP synthase 


0 


At4g04640 


0.57 


2207 


2208 


40S ribosomal protein S14 


1E-25 


At2g36160 


0.67 


2209 


2210 


putative protein 


0 


At4g19100 


0.76 


2211 


2212 


K Efflux antiporterKEAl 


0 


At1g01790 


0.65 


2213 


2214 


hypothetical protein 


1E-169 


At2g42980 


0.66 


2215 


2216 


cytochrome P450 like protein 


1E-01 


At4g36380 


0.48 


2217 


2218 


unknown protein 


8E-64 


At2g01520 


0.23 


2219 


2220 


hypothetical protein 


1E-157 


At1g07130 


0.66 


2221 


2222 


putative protein 


5E-04 


At5g09620 


0.62 


2223 


2224 


unknown protein 


0 


At1g08470 


0.66 


2225 


2226 


putative protein 


6E-37 


At3g54600 


0.7 


2227 


2228 


DnaJ - like protein 


1E-68 


At4g39960 


0.52 


2229 


2230 


putative protein phosphatase 2C 


1E-151 


At1g78200 


0.72 


2231 


2232 


biotin synthase (Bio B) 


0 


At2g43360 


0.67 


2233 


2234 


unknown protein 


3E-69 


At3g17510 


0.55 


2235 


2236 


high mobility group protein 2-like 


1E-107 


At3g51880 


0.65 


2237 


2238 


putative proline-rich protein 


0 


At2g2l140 


0.57 


2239 


2240 


cyclin delta-3 


0 


At4g34160 


0.74 


2241 


2242 


serine carboxypeptidase li - like protein 


0 


At4g30810 


0.77 


2243 


2244 


unknown protein 


0 


At1g67330 


0.7 


2245 


2246 


putative protein 


7E-93 


At3g56010 


0.7 
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2247 



2249 



2251 



2248 |GTP-binding protein LepA homolog 



0 |At5g08650 



PCT/EP2003/On658 



2250 



unknown protein 



2252 



2253 



2254 



2255 



2256 



2257 



2259 



2261 



2263 



putative protein 



2258 



putative protein 



putative enoiase 



putative protein 



2260 



putative protein 



2262 



0-methyltransferase, putative 



2264 



beta-1,3-9lucanase class I pfscursor 



2265 



2266 



2267 



2268 



2269 



2271 



protein ptiosptiatase 2C (PP2C) 



root cap protein 2-lii<e protein 



2270 



putative adenosine phosphosulfate kinase 



2272 



2273 



2274 



2275 



2276 



2277 



2278 



2279 



2280 



2281 



2282 



2283 



2284 



2285 
2287 



2286 



2288 



2289 



2290 



2291 



2293 



2295 



2297 



2299 



2301 



2303 



2305 



2307 



2309 



putative protein 



putative uroporptiyrinogen decarix>xylase 



putative protein 



branclied-ctiain amino add aminotransferase, putative 



copia-lilte retroeiement pol polyprotein 



neoxanttiin cleavage enzyme-lilce protein 



tiypottieticai protein 



2292 



2294 



2296 



2298 



2300 



2302 



2304 



2306 



2308 



2310 



2311 



2312 



2313 



2314 



2315 
2317 



2319 



2316 



i3E-05 



At3g10420 



At3g51510 



At3g45870 



At1g74030 



At5g11680 



At5g26280 



2E-27 



At1g21100 



At4g16260 



1E-174 



2E-9 



1E-151 



unitnown protein 



asparagine synttietase ASN3 



tiypcttietical protein 



E-56 



At3g 11410 



At5g54370 



At2g14750 



At4g30010 



At2g40490_ 
At3g57400 



0 



At3g19710 



At2g19830 



At4g19170 



At1g31860 



At2g26570 



expansin S2 precursor, putative 



5'-adenyiylsulfate reductase 



0 



putative protein 



9E-02 



putative IVIYB family transcription factor 



4E-31 



Putative protein l(inase 



3E-23 



putative protein 



AALP protein 



putative galactinol syntliase 



cvanotiydrin lyase lilce protein 



putative protein 



putative glucosyltransferase 



cysteine endopeptidase precursor - like protein 



unknown protein 



2318 



2320 



2321 



2323 



2325 



2322 



2327 



2324 



putati ve peroxiredoxin 
iVlAPKK 



E-80 



E-114 



At5g 10240 



At1g64770 



At1g20190 



At4g04610 



At3g59680 



At2g37630 



At1g51850 



At5g15910 



At5g60360 
At2g47180 



At4g16690 



At5g038B0 



1E-122 



2E-86 



tiypottietical protein 



7E-74 



2326 



UDP glucose:(lavonoid 3-o-giucosvltransferase, putative 



2E-90 



tiypottietical protein 



2328 



2329 



2330 



2331 



2332 



2333 



2335 



2337 



2334 



2336 



2339 



2341 



2343 



2345 



2347 



oxidoreductase -like protein 



hypottietical protein 

carnitine/acylcamifine translocase-like protein 



MATH protein 



unknown protein 



2338 



2340 



cytochrome P450-like protein 



putative endo-1,4-beta glucanase 



2342 |hevein-like protein precursor 



At2g30150 



At3g48350 



At3g07700 



At3g2606D 



At1g73500 



Atl 964780 



At1g30530 



At4902800 



At395529Q 



At1g50670 



At5g46800 



1E-169 



0 



8E-36 



OE+00 



2344 jleudne zipper-containing proteinATI 03 



1E-139 



2346 |delta-1-pyrroline-5-carboxylate synthetase 



0 



2348 remorin 



2349 12350 |putative protein 



At4g00780 



At1g22630 



At4g37330 



At4g02290 



At3g04720 



At3g56940 



At3g55610 



At2g45820 



At5g22460 
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0.42 



0.58 



0.65 



0.71 



0.58 



0.63 



0.51 



0.67 



0.75 



0.47 



0.73 



0.75 



0.71 



0.3 



0.72 



0.38 



0.7 



0.61 



0.72 



0.51 



1043 



0.71 



10.73 



0.6 



0.76 



0.63 



0.69 



0.56 



0.57 



0.73 



0.65 



0.7 



0.76 



0.58 
0.52 



0.59 



0.55 



0.65 



0.73 



0.58 



0.57 



0.76 



0.72 



0.62 



0.75 



0.63 



0.69 



0.76 
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2351 
2353 



2357 



2359 



2361 



2363 



2365 
2367 



2352 



putative lectin 



2354 
2356 
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putative protein 



2358 



2360 



2362 



2364 



2369 



2371 



2373 



2375 



2377 



2379 



2381 



2383 



2366 



2368 



2370 
2372 



2374 



2376 



2378 



2380 



2382 



2385 



2387 



2389 



2391 
2393 



2395 



2397 



2384 



putative cell division control protein, cdc2 kinase 



2386 



2388 
2390 



2392 



2394 



2396 



2399 



2401 



2405 



2407 



2409 



2411 



2413 



2415 



2417 



2419 



2398 



2400 



2402 



2404 



2406 



2408 



2410 



2412 



2414 



2416 



peptidylprolyl isomerase R0C4 



0-metliyltransferase, putative 



putative zinc finger protein 



[At3g 16530 



9E-29 
0 



0 



putative tiydroxyproline-ricti glycoprotein 



0 



putative protein 1 photosystem 11 oxygen-evolving complex 



hypothetical protein 



1E-173 



unl<nown protein 



tieat stiocl< protein 90 



Overlap vifitti bases 87,142-90,425 of 'IGF BAG clone F9K20 
accession 



phosptioglycerate kinase, putative 



1E-115 



At5g26260 



At3g62030 



At1g21130 



At4g38960 



At1g13930 



At3g50820 



At1g66700 



At1g52870 



At5g56010 



putative lectin 



profilin 2 



HSP associated protein lil<e 



5E-16 



putative protein 



1E-75 



ribosomal protein S27 



vacuolar H+-transporting ATPase 16K ctiain 



expansin At-EXP5 



similar to cold acclimation protein WCOR413 fTriticum aestivum] 



E-120 



At1g78570 



IE-25 



1E-107 



3E-82 



At3g12780 



At3g16400 



At4g29350 



At4g22670 



At1g20930 



At5g08050 



Af5g47930 



At4g34720 



ciiloroplast membrane protein (ALBINOS) 



putative ttiioredoxin 



unknown protein 



hypofteticalprotein 



putative flavonol sullbtransferase 



possible apospory-associated like protein 



glycolate oxidase, putative 
jutative peroxidase ATP2a 



1E-159 



1E-102 



putative protein 



tiydroxypyruvate reductase (HPR) 



ptiotosystem I reaction centre subunit psaN precursor (PSI-N) 



2418 



2421 



2425 



2420 



2422 



2424 



2427 



2426 



2428 



plastid ribosomal protein S6, putative 



metiiylenetetrahydrofolate reductase MTHFR1 



putative piiotosystem I reaction center subunit II precursor 



unknown protein 



fumarate iiydratase 



Lil3 protein 



Tomeobox gene ATH1 



At3g29030 



At2g 15970 



At2g28800 



At1g08570 



At1g08380 



At1g07180 



At1g74090 



At4g25900 



At3g14420 



IE-154 



0 



0 



At2g37130 



At4g21860 



At1g68010 



At5g64040 



At1g64510 



At3g59970 



At1gO3130 



At3g10940 



At5g50950 



At5g47110 



At4g32980 



0.58 
0.65 



0.63 



0.43 



0.52 
0.61 



0.63 
0.72 



0.43 
0.75 



0.63 



73 



0.4 
0.77 



0.75 



0.72 



0,65 



0.65 



0.53 



0.69 



77 



0.76 



0.52 



0.74 



0.72 



0.55 



0.71 



0.71 



0.75 



0.75 



0.74 



0.49 



.72 



0.55 



0.64 
0.43 



0.73 



putative lectin 



2431 



2432 



2433 



2434 



C0P1-interacting protein 7 (CIP7) 



3E-20 



AtSg 16390 



2435 



2£- 

2- 



2436 
2438 



putative acyi-CoA synthetase 



E-07 



At4g27430 



unknown protein 



At2g47240 



hydroxymethyltransferase 



At2g01590 



2^^1 



2440 hypothetical protein 



2443 



2442 SNF1 related protein kinase (ATSRPK1 ) 



E-164 



2444 



2445 



2446 



2447 



mevalonate diphosphate decarbox ylase 
Iputative flavonol sulfotransferase 



E-170 



E-68 



2449 
2451 



2448 [protein phosphatase 2C (AtP2C-HAr 



2450 



2452 



cinnamoyl-CoA reductase - like protein 
0-methyltra nsferase - like protein 



At4g 13930 



At1g69490 



At3g23000 



At2g38700 



At1 974090 



At1g72770 



At4g30470 



At4g35160 



0.43 



0.67 



0.72 



0.51 



0.68 



0.27 
0.49 



0.71 
0.69 



0.59 



0.72 
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2453 



2454 



2455 



2456 



2457 



2458 



2459 



2461 



2463 



2465 



pyruvate dehydrogenase E1 alpha subunit 



putative chlorophyll A-B binding protein 



2460 



2462 



putative UDP-N-acetylglucosamine pyrophosphorylase 



putative protein 



Ca2+-dependent membrane-binding protein annexin 



2464 



hypothetical protein 



2466 



2467 



2468 



2469 



2470 



2471 



2473 



2475 



2477 



2472 



2474 



2476 



2478 



hypothetical protein 

h ypothetical protein . 

aromatic amino-acid decarboxylase - like protein 



unknovtfn protein 



hypothetical protein 



putative elongation factor P (EF-P) 



2479 



2480 



2481 



2482 



2483 



2485 



2487 



2489 



unknovm protein 



putative protein 



unknovtfn protein 



2484 



aldo/keto reductase-like protein 



2486 



qiycine decarboxylase complex H-protein 



2488 



thioredoxin (clone G1F1) (pirHS58118) 



2490 



2491 



2493 
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CLAIMS 

1 A method to alter one or more plant characteristics, said method comprising modifying, in a 
plant, expression of one or more nucleic acids and/or modifying level and/or activity of one or 
more proteins, v^hich nucleic acids and/or proteins are essentially similar to any one of SEQ ID 
NO 1 to 2755. and wherein said one or more plant characteristics are altered relative to 
conresponding wild type plants. 

2 A method according to claim 1. wherein said altered plant characteristic is selected from 
any one or more of the following: altered development, altered growth, increased yield and/or 
biomass. enhanced survival capacity, enhanced stress tolerance, altered plant architecture, 
altered plant physiology, altered plant biochemistry, altered metabolism, altered DNA 
synthesis, altered DNA modification, altered endoreduplication. altered cell cycle, altered cell 
wall biogenesis, altered transcription regulation, altered signal transduction, altered storage 
lipid mobilization and/or altered photosynthesis, each relative to corresponding wild type 
plants. 

3. A method according to claim 2. wherein said altered metabolism comprises altered 
nitrogen and/or altered carbon metabolism. 

4. A method according to claim 2. wherein said increased yield and/or biomass. comprises 
increased seed yield. 

5. A recombinant nucleic acid comprising: 

(a) one or more nucleic acid sequences essentially similar to any one of SEQ ID NO 1 
to 2755 or the complement strand thereof; optionally operably linked to 

(b) a regulatory sequence, and optionally operably linked to 

(c) a transcription termination sequence 

6. A recombinant nucleic acid according to claim 5. wherein said regulatory sequence is a 
plant-expressible promoter. 

- 7. A recombinant nucleic acid according to claims 6. wherein said plant-expressible promoter 
is any one of the promoters listed in Table I. II, III or IV. 
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8. A method for making a transgenic plant or plant cell having one or more altered plant 
characteristics when compared to the corresponding wild-type characteristics, said method 
comprising introduction of a recombinant nucleic acid according to claim 5, 6 or 7 into said 
plant or plant cell. 

5 

9. A method according to claim 8, wherein said recombinant nucleic acid is stably integrated 
into the genome of said plant . 

10. A method according to any of claims 1 to 4 or 8 or 9, comprising overexpression of one or 
10 more nucleic acids essentially similar to any one of SEQ ID NO 1 to 2755. 

11. A method according to any of claims 1 to 4 or 8 or 9, comprising downregulation of 
expression of one or more nucleic acids essentially similar to any one of SEQ ID NO 1 to 2755. 

15 12. A transgenic plant having one or more altered characteristics when compared to the 
conresponding wild-type plant, characterized in that said plant has modified expression of one 
or more nucleic acids and/or modified level and/or activity of one or more proteins, said nucleic 
acid and/or protein being essentially similar to any one of SEQ ID NO 1 to 2755. 

20 13. A transgenic plant obtainable by a method according to any of claims 1 to 4 or 8 to 1 1. 

14. A transgenic plant comprising an isolated nucleic acid and/or protein sequence essentially 
similar to any one of SEQ ID NO 1 to 2755. 

25 15. An ancestor, progeny, or any plant part, particularly a harvestable part, of a transgenic 
plant of claim 12 or 14. 

16. A host cell having one or more altered characteristics when compared to the corresponding 
wild-type host cell, characterized in that said host cell has modified expression of one or more 
30 nucleic acids and/or modified level and/or activity of one or more proteins, said nucleic acid 
and/or protein being essentially similar to any one of SEQ ID NO 1 to 2755. 

: 17. Use of a nucleic acid sequence or protein essentially similar to any one of SEQ ID NO 1 to 
2755, for altering one or more plant characteristics. 

35 

18. A method for the production of plants with one or more altered characteristics when 

121 



PCT/EP2003/011658 

WO 2004/035798 

compared to the corresponding wild-type plants, which method comprises the use of a nucleic 
acid sequence essentially similar to any of SEQ ID NO 1 to 2755 in marker assisted breeding. 

19 A method for the production of plants with one or more altered characteristics when 
5 compared to the corresponding wild-type plants, which method comprises the use of a nucleic 
acid sequence essentially similar to any of SEQ ID NO 1 to 2755 in conventional breeding. 

20. A plant obtainable by the methods according to claim 18 or 20. 

10 21. Use of a nucleic acid and/or a protein essentially similar to any one of SEQ ID NO 1 to 
2755 as a growth regulator. 

22. A growth regulating composition comprising a nucleic acid and/or a protein essentially 
similar to any one of SEQ ID NO 1 to 2755. 

23. A method for the production of a growth regulator, comprising the production of a nucleic 
acid and/or a protein essentially similar to any one of SEQ ID NO 1 to 2755. 

24 Method for the production of enzymes and/or pharmaceuticals, which method comprises 
20 modifying expression of a nucleic acid, and/or modifying level and/or activity of a protein, sa.d 
nucleic acid and/or protein being essentially similar to any one of SEQ ID NO 1 to 2755. 

25. Use of plants according to claim 11 to 14, for the production of enzymes and/or 
pharmaceuticals. 

25 

26. Enzymes and pharmaceuticals produced according to the method of claim 24. 

27. Use of a nucleic acid or a protein essentially similar to any one of SEQ ID NO 1 to 2755, as 
a therapeutic agent, a diagnostic means, a kit or plant effective agent. 

30 . 

28. A therapeutic composition comprising a nucleic acid and/or a protein essentially similar to 

any one of SEQ ID NO 1 to 2755. 

29. A diagnostic composition comprising a nucleic acid and/or a protein essentially similar to 
35 any one of SEQ ID NO 1 to 2755. 
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33. A food prod.0. danved from a p,an. or host o^l accoMing ,o any one of cdaim 12 ,o 16. 

34. Use Of a food product according to clair. 33 in animal teed or food. 

--orpr.in:r;;:erar^^^^^^ 

36^ Use Of a nucleic acid or a protein essentially simlar to any one of SEQ ,D NO 1 to 2755 as 

aon or any of the nucleic acids represented by SEQ ID NO 1 to 2755. 
promoter of any one of said nucleic acids represented by SEQ ID NO 1 to 2755. 
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MATDB - entry At1g57680 from contig 18123 

(Chromosomo 1 / BAC clone T8L23 / sequonCB datatuso accession EMBL:AC079T331 



Type: gene/protein 

Code: At1g57680 
* Old code : T8L23_15 

Title: - -tative protein 

Contig: .3 

Position : 32-54480 (C) 

Notes 



Classifica!. 

• Known protein 
Functional Category 

• UNCLASSIFIED PROTEINS 
TarqetP prediction 

• Targeted to secretory pathway 

• TargetP score: 0.968 

• TargetP reliability dass: 2 

• Probable signal sequence length: - 

TIWHMM transmembrane prediction 

• Very likely to be a transmembrane protein (or have a signal peptide) (Exp 
number of AA in TMHs: 110) 

• A transmembrane region could actually be a signal peptide (Exp number, first 
60 AAs: 21) 

• Orientation of N-termina!: external side (probability: 0.9) 

• Transmembrane regions- 

• 40-62 

• 83-100 

• 138-160 

• 181-203 

• 213-235 

EMBL 

• AY072149 



mRNA matches: 1 found 
Arabidopsis ESTs 

• found 10 

AA585779 : Ai992654 : AI998042 : AV518701 : AV538415 : AV538372 : 
AV541088 : AV550688 : AV550640: AV554579 : 
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Full report 

• Full report induiJes FST matches and external annotation- stow. 
Protein properties 

PEDANT and Interpro data are being recalculated. To access old PEDANT data, use the link in the 
fen frame b2t be aware that some protein sequences have been changed due to update of gene 
models based on cDNA data and PEDANT data may be outdated. 



Click here to submit new infonnaUon about this entry 



10/18/2002 956 AM 

2 0(2 



FIGURE 4 (contin.) 



wo 2004/035798 



PCTyEP2003/0n658 



6/10 



• „^.„ http://mlp5.gsf.de/cgI-l)ln;pn>J/lhal/9el_pep7t8l23AAt1g57680 

A.ttiaIiana-con«gtBt23-enliyAt1g57680 nnpj/m.p a 

A. thaliana - contig t8I23 - entry Atl g576B0 ; ""'^^ 

>Pl;AtXgS76B0 

"S^VPofpGWICLVA LLVU/3LLCI AYSFYFQSKV RKQGYIQUH raGPVIIIRIT 
FJLfSv gSsU* RHRIUa.SGI^ I.RWQENVCKW "VSNLGFAE 
LiAPLKMESG ALSGKKHRDT AGYIILYCl-P MLRLQLAVW. SESRIMGGSG SVV^©^ 

JtTYSRVIIDH DEVALCWPL LSTIU/3VFA AVLTAYI^ S^^^m^r, l ^?^DsS 
IFSVSSFLPL RIVMLCLSVL TAADKIIFEA LSFIAFLSLP CFCVVSICLL VYFPVSESMA 
LRG^DTODE DTAVTEERSG ALLLAPNSSQ TDEGLSLRGR RDSGSSTQER YVEUILFLEA 
EM* 

C- Length 362 aa 

C; Sequence Atlg576B0 was extracted from t81Z3 
C; Fragment (54480-53392(0) 
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, .TCCCCCTOA -^^^ ■ 

61 CTGCTAGTTC TrrTGGGTCT CCTTTGCATC GCTTACTCGT i^i^ CCGAATCACT 
121 CGTAAGCAAG GCTATATTCA RCTTGCTTAC J"^^^ 5StGM TTTGTTGAGG 
181 TTCRTTCTCT TTGCTATCTG GTGGgcTGTT GGTGAm TTGCAACTGG 
241 OCJTCACAGAA GGrTGTTG^ TGGCTXraRT EIS^S^^ ??SrCT CATOTTTCTT 
301 TACATCGTTT COATCTAGG ATTTGO^ S^SS GAAAATGGRA CAGGGACACA 
361 CTGOGTGCTC CCTTGAAGRT GOAATCAGGG GCTTTGAGCO ^^^^ qcttGTGTTG 
421 GCftGOTTATA TTATTCTTTA TTGTCTCCCG arcTGCCACA CGACTTCACA 

«i icCGAGTCAC GCCTAAATGG TGGIAGTGGC iSScCTCTA 
S41 AGAACGTATT CCCGAGTTAT TA^TOATCAC 55^^^ S^SxACTT GTTCTGGCTT 
601 CTOAGTACCA TCCTTCTTGG TGTGTTTGCA GCCCTCC^ SSgAGT ATACACTTTQ 
661 GGAAGGCAGA TACTGAAACT TGTCATTMlC AAJCGTrraC GTCGGTTCTC 
721 ATATTCTOSG TCTCQAGXTT CCTTCCATTA AGGMTGTTA CTCCCTCTTC 

781 aS^ acaaoattat attcga^ ^^^^ ^^CC 
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A. thaUarta-eonlig 18123- coordinates: 53392-M«Otc> y <~ 
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. th^iiam . contig 18123 - coordinates: S3392-54480 (C) 
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. ,tBtta.ct. tt.a..r.ct .^.ntt.aa, gtUt..c« ,gct.«t,t a.«c.ata. 
61 lgttai-.tg.%t ttgtatgTit^ naUtwSg gtctttgttt ci^aattnttg 

181 <itah;-.ttt:gg taagtctetg a"a.ttgtc 9"-^ gggtgatctg ggctgctttc 

241 tatc^agtgt t^aagagtcs ?^Icttg«« aglttcgtaa aaggttcaat 

361 ctttctccgt ttttcatcaa ^ff^t-^taa -^-^^-^95 j. tag agagcatgcg 

421 qcacattgca ctgctattgc ^mACT TCCCGRTGCR TTCGGCGTTG 

4B1 igg^taotgt ^t"tgggtg »^cc^rai_^^A gcttactcgt 

541 TGACGATATG TCTAGTCGCT =12^^?^^ ^SttS ACITGSTTAC TTCAGTGOTC 
601 TCTATTTCCA CSTCTCACGTT CGTWW^ SaTCTC GTOGGCTCTT GGTGAGArTT 
661 CCTOSATTAT CCQAATCACT TXC^^TCTCT TTCCXRT^ tcqcSTTGQAT CTGRCSATGGC 
721 TTCGATTGAG TTTCTTGA^ CGTCACAGAA ^^^^ JSgGAG CCTTGTCrCT 
781 AAGARAACGT TTGCAAGTGG ^CATCGTTT CCAMCTM^ gqraTCAGGG GCTTTGAGCG 
841 TTCTGACTCT CATGITTCTT CTGCCTGCTC ^J^^^ TTOTCTCCCG ATGCTTGCTC 
901 GAAAATGGAA CAGGGACA^ S^J^fSSc aCCTASlCG TGGTAGTGGC TCTTATG^ 
961 TTCAACTTGC GGTTGTGTTG TCCGAGT^ ccSl*T TATTCATCAC GACGAGGTGG 
1021 AGCTGCCACA CQACTTCACA AGAACGTATT CCroAGTT^ .^^^^ GCXX3TCCTAA 
1081 CCTTATGCaC *1ATCCTCTA CTGAOTACffl ^CCTTCTTOa ^.^^ *ASCGrTTM: 
1141 CACCTTACTT C3TTCTGGCTT ?SotT CCTTCCATTA AGOATTGTl^ 

1201 agaagagagt atacactttg atattctotg tctoba^ attcgaagcc ctttctttct 

1261 TGCTCTGTTT GTCGGTTCTC ACACCAGCAG CTGCTTGCTT GTCTACTTCC 

1321 TGGCCTTCCT CTCCCTCTTC TGCTTTTGCG TGGTATCCAT CTB GATAOSGCTG 
1381 CGGTTTCAGA TTCCATGGCC CTGAGAGGTC cTCTTCACAA ACTSATGAGG 

1441 TGACCGRAGA RCGCAGTGGT GCTCTGTT^ ^^5?^ ACAGGAGACG TATGTGGAAC 
1501 GATTGAGCTT AAGAGGTCGG ^GAGACTCGG GATCGTCiA ^trcta tttqgotxtt 

1S61 TCAGCCTATT TCTGGAAGCT °^^^^^^% H^^-^^V^ gqataaagag tttaagtacg 
1621 ggcaatgtac agartcctgg T.gaaa..^^g- tSatt^tg iggcaaaga-.- actttgaccg 

1741 taaaggaggg r-tlraaggggg "^tta^ncttg '-^^^^^ ^ tttgatttag aaaaaaaaaa 
1801 ttgaf.ggatc stacttttgt »^^^tnatt. t^^gat^a ^^^^^^^^^^ ttttacccto 
1861 aaaaaacaat agccgtgaac fl-^^a^^^ga -^^t „gagaaat.t 9Mcc8agca 

1921 tattotttgg f^t=«^^^-^ aalc?tcca° gtcaatwga cotagacaaa tacaottatg 
^oSi ^oSaaa ^^.tfg^cafa "^tLatat ?.aaccaagt ttg.caaga 
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